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© Non-human primate CD4 polypeptides and human CD4 molecules capable of being glycosylated. 

© The invention relates to substantially pure non-human primate CD4, and fragments thereof which bind to HIV 
or SIV gp120. The invention also relates to gp120 binding molecules related to human CD4 but which may exist 
in glycosylated form. 

The invention also relates to fusion proteins which comprise the CD4 molecules of the invention, or 
fragments thereof, and an immunoglobulin light or heavy chain, wherein the variable region of the light or heavy 
chain has been replaced with CD4 or fragment thereof which is capable of binding to gpl20. The invention also 
relates to fusion proteins comprising the CD4 molecules of the invention and a cytotoxic polypeptide. 

The invention also relates to an immunoglobulin-like molecules comprising the fusion proteins of the 
invention together with an immunoglobulin light or heavy chain. 

The invention also relates to methods of treating HIV or SIV infection comprising administering the CD4 
molecules of the invention, glycoproteins, fragments thereof, fusion proteins or immunoglobulin-like molecules of 
the invention to an animal. 

The invention also relates to assays for HIV or SIV comprising contacting a sample suspected of containing 
HIV or SIV gp120 with the CD4 molecules of the invention, fragments thereof, glycoproteins, immunoglobulin-like 
molecules, or fusion proteins of the invention, and detecting whether a complex is formed. 

The invention also relates to nucleic acid molecules which specify the proteins, glycoproteins and fusion 
proteins of the invention as well as vectors and transformed hosts. 
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NON-HUMAN PRIMATE CD4 POLYPEPTIDES, HUMAN CD4 MOLECULES CAPABLE OF GLYCOSYLATION, 
FRAGMENTS THEREOF, FUSION PROTEINS THEREOF, GENETIC SEQUENCES, AND THE USE THEREOF 



FIELD OF THE INVENTION 

The Invention Is in the field of recombinant genetics and pharmaceutical compositions. 

5 

BACKGROUND OF THE INVENTION 

The human and simian immunodeficiency viruses HIV and SIV are the causative agents of Acquired 
w Immune Deficiency Syndrome (AIDS) and Simian Immunodeficiency Syndrome (SIDS), respectively. See 
Curren, J. et aL , Science 329 : 1359-1357 (1985); Weiss, R. et aL , Nature 324 :572-575 (1986). The HIV 
virus contains an envelope glycoprotein, gp120 which binds to the CD4 protein present on the surface of 
helper T lymphocytes, macrophages and other cells. Dalgleish et aL Nature , 312 :763 (1984). After the 
gp120 binds to CD4, virus entry is facilitated . by an envelope-mediated fusion of the viral target cell 
75 membranes. 

During the course of infection, the host organism develops antibodies against viral proteins, including 
the major envelope glycoproteins gp120 and gp41. Despite this humoral immunity, the disease progresses, 
resulting in a lethal immunosuppression characterized by multiple opportunistic infections, parasitemia, 
dementia and death. The failure of host anti-viral antibodies to arrest the progression of the disease 

20 represents one of the most vexing and alarming aspects of the infection, and augurs poorly for vaccination 
efforts based upon conventional approaches. 

Two factors may play a role in the inefficacy of the humoral response to immunodeficiency viruses. 
First, like other RNA viruses (and like retroviruses in particular), the immunodeficiency viruses show a high 
mutation rate which allows antigenic variation to progress at a high rate in response to host immune 

25 surveillance. Second, the envelope glycoproteins themselves are heavily glycosylated molecules presenting 
few epitopes suitable for high affinity antibody binding. The poorly antigenic, "moving" target which the viral 
envelope presents, allows the host little opportunity for restricting viral infection by specific antibody 
production. 

Cells infected by the HIV virus express the gp120 glycoprotein on their surface. Gp120 mediates fusion 
30 events among CD4* cells via a reaction similar to that by which the virus enters the uninfected cell, leading 
to the formation of short-lived multinucleated giant cells. Syncytium formation is dependent on a direct 
interaction of the gp120 envelope glycoprotein with the CD4 protein. Dalgleish et aL , supra , Klatzmann, D. 
et aL , Nature 312 :763 (1984); McDougal, J.S. et aL Science , 231^ :382 (1986); Sodroski. J. et aL , Nature , 
3227470 (1986); Lifson, J.D. et aL , Nature , 323 :725 (1988); Sodroski, J. et aL , Nature , 3 21^:412 (1986). 
35 The human CD4 protein consists of a 372 amino acid extracellular region containing four 
immunogfobulin-like domains, a membrane spanning domain, and a charged intracellular region of 40 amino 
acid residues. Maddon, P. et aL , Cell 42 :93 (1985); Clark, S. et aL , Proc. Natl. Acad. ScL (USA) 84 : 1649 
(1987). 

Evidence that CD4-gp120 binding is responsible for viral infection of cells bearing the CD4 antigen 
40 includes the finding that a specific complex is formed between gpl20 and CD4. McDougal et aL , supra . 
Other workers have shown that cell lines, which were non-infective for HIV, were converted to infectable cell 
lines following transfection and expression of the human CD4 cDNA gene. Maddon et aL , Cell 47 :333-348 
(1986). PCT Application Publication Nos. WO 88/01304 (1988) and WO89/01940 (1989) disclose that 
soluble forms of human CD4 comprising the immunoglobulln-tike binding domains are useful for the 
45 treatment or prophylaxis of HIV infections. 

In contrast to the majority of antibody-envelope interactions, the receptor-envelope interaction is 
characterized by a high affinity (Kg = 10 s l/mole) immutable association. Moreover, the affinity of the virus 
for human CD4 is at least 3 orders of magnitude higher than the affinity of human CD4 for its putative 
endogenous ligand, the MHC class II antigens. 
50 A number of workers have disclosed methods for preparing hybrid proteins. For example, Murphy, 
United States Patent 4,675,382 (1987), discloses the use of recombinant DNA techniques to make hybrid 
protein molecules by forming the desired fused gene coding for a hybrid protein of diphtheria toxin and a 
polypeptide ligand such as a hormone, followed by expression of the fused gene. 

Many workers have prepared monoclonal antibodies (Mabs) by recombinant DNA techniques. Mon- 
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oclonal antibodies are highly specific well-characterized molecules in both primary and tertiary structure. 
They have been widely used for in vitro immunochemical characterization and quantitation of antigens. 
Genes for heavy and light chains have"been introduced into appropriate hosts and expressed, followed by 
reaggregation of the individual chains into functional antibody molecules (see, for example, Munro, Nature 
5 312 :597 (1984); Morrison, S.L., S cience 229 :1202 (1985); Oi et al Blotechniques 4 :214 (1986); Wood et 
al ., Nature 314 :446-449 (1985)). Light- and heavy-chain variable regions have beencloned and expressed 
in foreign hosts wherein they maintained their binding ability (Moore et al ., European Patent Application 
0088994 (published September 21, 1983)). 

Chimeric or hybrid antibodies have also been prepared by recombinant DNA techniques. Oi and 

70 Morrison, Biotechniques 4 214 (1986) describe a strategy for producing such chimeric antibodies which 
include a chimeric humanlgG anti-leu3 antibody. 

Gascolgne, N.R.J., et al. . Proc. Natl. Acad. Sci. (USA) 84 :2936-2940 (1987) disclose the preparation of 
a chimeric gene construct containing a T-cell receptor a-chain variable (V) domain and the constant (C) 
region coding sequence of an immunoglobulin 7 2a molecule. Cells transfected with the chimeric gene 

is synthesize a protein product that expresses immunoglobulin and T-cell receptor antigenic determinants as 
well as protein A binding sites. This protein associates with a normal chain to form an apparently normal 
tetrameric (H2L2, where H = heavy and L = light) immunoglobulin molecule that is secreted. 

Sharon^J., et al . ^Nature 309 :54 (1984), disclose construction of a chimeric gene encoding the variable 
(V) region^f a mouse heavy chafn specific for the hapten azophenylarsonate and the constant (C) region of 

20 a mouse kappa light chain (V h Ck). This gene was introduced into a mouse myeloma cell line. The chimeric 
gene was expressed to give a protein which associated with light chains secreted from the myeloma ceil 
line to give an antibody molecule specific for azophenylarsonate. 

Morrison, Science 229 :1202 (1985), discloses that variable light- or variable heavy-chain regions can be 
attached to a non-lg sequence to create fusion proteins. This article states that the potential uses for the 

25 fusion proteins are three: (1) to attach antibody specifically to enzymes for use in assays: (2) to isolate non- 
lg proteins by antigen columns; and (3) to specifically deliver toxic agents. 

Recent techniques for the stable introduction of immunoglobulin genes into myeloma cells (Banerji, J., 
et al Cell 33 :729-740 (1983); Potter, H., et al ., Proc. NatL Acad. Sci. (USA) 81^ :7161-7165 (1984)), 
coupled with detailed structural information, have permitted the use of in vitro DNA methods such as 

00 mutagenesis, to generate recombinant antibodies possessing novel properties. 

PCT Application WO87/02671 discloses methods for producing genetically engineered antibodies of 
desired variable region specificity and constant region properties through gene cloning and expression of 
light and heavy chains. The mRNA from cloned hybridoma B cell lines which produce monoclonal 
antibodies of desired specificity is isolated for cDNA cloning. The generation of light and heavy chain 

35 coding sequences is accomplished by excising the cloned variable regions and ligating them to light or 
heavy chain module vectors. This gives cDNA sequences which code for immunoglobulin chains. The lack 
of introns allows these cDNA sequences to be expressed in prokaryotic hosts, such as bacteria, or in lower 
eukaryotic hosts, such as yeast 

The generation of chimeric antibodies in which the antigen-binding portion of the immunoglobulin is 

40 fused to other moieties has been demonstrated. Examples of non-immunoglobuhn genes fused to anti- 
bodies include Staphylococcus aureus nuclease, the mouse oncogene c-myc . and the Klenow fragment of 
E. coli DNA polymerase I (Neuberger, M.S., et al Nature 312 :604-612 (1984): Neuberger. M.S., Trends in 
Biochemical Science , 347-349 (1985)). European Patent Application 120.694 discloses the genetic 
engineering of the variable and constant regions of an immunoglobulin molecule that is expressed in E. coli 

45 host cells. It is further disclosed that the immunoglobulin molecule may be synthesized by a host cell with 
another peptide moiety attached to one of the constant domains. Such peptide moieties are described as 
either cytotoxic or enzymatic. The application and the examples describe the use of a lambda-like chain 
derived from a monoclonal antibody which binds to 4-hydroxy-3-nitrophenyl (NP) haptens. 

European Patent Application 125,023 relates to the use of recombinant DNA techniques to produce 

so immunoglobulin molecules that are chimeric or otherwise modified. One of the uses described for these 
immunoglobulin molecules is for whole-body diagnosis and treatment by injection of the antibodies directed 
to specific target tissues. The presence of the disease can be determined by attaching a suitable label to 
the antibodies, or the diseased tissue can be attacked by carrying a suitable drug with the antibodies. The 
application describes antibodies engineered to aid the specific delivery of an agent as "altered antibodies." 

55 PCT Application WO83/101533 describes chimeric antibodies wherein the variable region of an 
immunoglobulin molecule is linked to a portion of a second protein which may comprise the active portion 
of an enzyme. 

Boulianne et al ., Nature 312 :643 (1984) constructed an immunoglobulin gene in which the DNA 



3 



EP 0 414 178 A2 



segments that encode mouse variable regions specific for the hapten trinitrophenol (TNP) are joined to 
segments that encode human mu and kappa regions. These chimeric genes were expressed to give 
functional TNP-binding chimeric IgM. 

Morrison et al ., P.N.A.S. (USA) 81 :6851 (1984), disclose a chimeric molecule utilizing the heavy-chain 
variable region" exons of an anti-phosphoryl choline myeloma protein G, which were joined to the exons of 
either human kappa light-chain gene. The genes were transfected into mouse myeloma cell lines, 
generating transformed cells that produced chimeric mouse-human IgQ with antigen-binding function. 

PCT Application Publication No. WO89/02922 (1989), discloses chimeric antibody molecules compris- 
ing human CD4. Such chimeric antibody molecules may be administered to a subject infected with HIV to 
treat the HIV infection. 

Despite the progress that has been achieved on determining the mechanism of HIV infection, a need 
continues to exist for methods of treating HIV viral infections. 

SUMMARY OF THE INVENTION 



The Invention relates to a nucleic acid molecule specifying non-human primate CD4, or an HIV or SIV 
gp120 binding fragment thereof. 

In particular, the invention relates to a nucleic acid molecule specifying rhesus monkey CD4 comprising 
the following ONA sequence: 

1 ATGAACCGGGGAATCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCGCTACTCCCA 
-25 MetAsnArgGlylleProPheArgHlsLeuLeuLeuVaUeuGlnLeuAlaLeuLeuPro 

GCAGTCACCCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGATACAGTGGAACTGACC 120 
AlaValThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 15 



121 TGTACAGCnCGCAGAAGAAGAACACACAA 

16 CysThrAlaSerGlnLysLysAsnThrGlnpheHisTrpLysAsnSerAsnGlnlleLys 
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ATTCTG6GAATTCAGGGTCTCTTCTTAACTAAAGGTCCATCCAAGCT6AGCGATC6TGCT 240 
IleLeuGlylleGlnGlyLeuPheLeuThrLysGlyProSerLysLeuSerAspArgAla 55 

24 1 GACTCAAGAA^U\AGCCTnGGGACCMGGATGCTTTTCCATGATCATCAAGAATCTTAAG 
56 AspSerArgLysSerLeuTrpAspGlnGlyCysPheSerMetnelleLysAsnLeuLys 

10 ATAGAAGACTCAGATACTTACATCTGTGAAGTGGAGAACAAGAAGGAGGAGGTGGAATTG 360 

II eGl uAspSerAspThrTyr II eCysGl uVal 61 uAsnLysLysGI uGl uVal Gl uLeu 95 

361 CTGGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTTGAGGGGCAAAGCCTGACC 
' 5 96 leuVal PheGl yLeuThrAl aAsnSerAspThrHi sleuteuGI uGlyGl nSerLeuThr 

CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGAAATGTAGGAGTCCAGGGGGT 480 
LeuThrLeuGluSerProProGlySerSerProSerValLysCysArgSerProGlyGly 135 

so 

481 AAAAACATACAGGGGGGGAGGACCATCTCTGTGCCTCAGCTGGAGCGCCAGGATAGTGGC 
1 3 6 LysAsnl 1 eGl nGl yGl yArgThr 1 1 eSerVal ProGl nLeuGl uArgGl nAspSe rGl y 

ACCTGGACATGCACCGTCTCGCAGGACCAGAAGACGGTGGAGTTCAAAATAGACATCGTG 600 
ThrTrpThrCysThrVal SerGI nAspGl nLysThrVal Gl uPheLys IleAspIl eVal 1 75 

30 601 GTGCTAGCTTTCCAGAAGGCCTCCAGCACAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 
176 Val LeuAl aPheGl nLysAl aSerSerThrValTyrLysLysGl uGlyGl uGl nVal Gl u 

TTCTCCnCCCACTCGCCTTTACACTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 720 
35 PheSerPheProLeuAlaPheThrLeuGluLysLeuThrGlySerGlyGluLeuTrpTrp 215 

721 CAGGCGGAGAGGGCCTCCTCCTCCAAGTCTTGGATTACCTTCGACCTGAAGAACAAGGAA 
216 GlnAl aGl uArgAl aSerSerSerLysSerTrpIl eThrPheAspLeulysAsnlysGl u 

40 

GTGTCTGTAAAACGGGTTACCCAGGACCCCAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 840 
Val SerVal LysArgVal ThrGI nAspProLysLeuGI nMetGlyLyslysleuProleu 255 

84 1 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACGCTGGCC 
256 HI sLeuThrLeuProGI nAl aLeuProGl nTyrAl aGlySerGlyAsnLeuThrleuAl a 

50 CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTCGTGGTGATGAGAGCCACT 960 

LeuGluAlaLysThrGlyLysLeuHisGlnGluValAsnLeuValValMetArgAlaThr 295 



25 
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961 CA6TTCCAG6AAAATTT6ACCTGT6AA6T6TGG66ACCCACCTCCCCTAA6CT6ACGCTG 
296 G1 nPheGl nGl uAsnLeuThrCysGl uValTrpGlyProThrSerProlysleuThrLeu 

AGCTTGAAACTGGAGAACAAGGGGGCAACGGTCTCGAAGCAGGCGAAGGCGGTGTGGGTG 1080 
Serleuly s LeuGl uAsnLysGI yAl aThrVal Serly sGl nAl aLy sAl aVal TrpVal 335 

1081 CTGAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTA 
336 LeuAsnProGl uAl aGl yMetTrpGl nCysLeuLeuSerAspSerGlyGI nVal LeuLeu 

GAATCCAACATCAAGGTTGTGCCCACATGGCCCACCCCGGTGCAGCCAATGGCCCTGATT 1 200 
Gl uSerAsn I 1 eLys Val Val ProThrTrpProThrProVal 61 nProMetAl aLeu II e 375 

1201 GTGCTGGGGGGCGTTGCGGGCCTCCTGCTTTTCACTGGGCTAGGCATCTTCTTCTGTGTC 
376 VaUeuGlyGlyValAlaGlyLeuLeuLeuPheThrGlyLeuGlyllePhePheCysVal 

AGGTGCCGGCATCGAAGGCGTCAAGCAGAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 1320 
ArgCysArgmsArgArgArgGI nAl aGl uArgMetSerGl nil eLys ArgLeuLeuSer 415 



1321 GAAAAGAAGACCTGCCAGTGCCCTCACCGGTTTCAGAAGACATGTAGCCCCATTTGA 1377 
416 Gl uLysLysThrCysGl nCy$ProH1 sArgPheGl nLysThrCysSerProIl eEnd 433 



or a degenerate variant thereof. 

The invention also relates to a nucleic acid molecule specifying a soluble non-human primate CD4 
fragment. In particular, the invention to a soluble rhesus CD4 fragment (domain I) which binds HIV or SIV 
gp120 comprising the following DNA sequence: 

1 ATGAACCGGGGAATCCCTTTTAGGCACTT?CTTCTGGTGCTGCAACTGGCGCTACTCCCA 
-25 MetAsnArgGlylleProPheArnHlsLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 



GCAGTCACCCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGATACAGTGGAACTGACC 1 20 
AlaValThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 15 
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121 76TACAGCTTCGCAGAAGAAGAACACACAATTCCACTGGAAAAACTCCAACCAGATAAAG 
16 Cy$ThrAnSerGlnLysLy$AsnThr61nPheH1$TrpLysAsnSerAsnGlnneLys 

ATTCTGGGAATTCAGGGTCTCT7CTTAACTAAAGGTCCATCCAAGCTGAGCGATCGTGCT 240 
IleLeuGlylleGlnGlyLeuPheLeuThrLysGlyProSerLysLeuSerAspArgAla 55 

241 GACTCMGAAAMGCCTnGGGACCAAGGATGCTTTTCCATGATCATCAAGAATCTTAAG 
56 AspSerArgLysSerLeuTrpAspGlnGlyCysPheSerHetllelleLysAsnLeuLys 

ATAGAAGACTCAGATACTTACATCTGTGAAGTGGAGAACAAGAAGGAGGAGGTGGAATTG 360 
II eGl uAspSerAspThrTyr II eCysGl uVal Gl uAsnLysLysGl uGl uVal Gl uLeu 95 

361 CTGGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTT 
96 LeuValPheGlyLeuThrAlaAsnSerAspThrHlsLeuLeu 

or a degenerate variant thereof. 

The invention also relates to a nucleic acid molecule specifying chimpanzee C04. comprising the 
following DNA sequence: 



1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCACTCCTCCCA 
-25 MetAsnArgGlyValProPheArgHlsLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

GCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGACACAGTGGAACTGACC 120 
Al aAl aThrGI nGlyLysLysVal Val LeuGlyLysLysGlyAspThrVal Gl uLeuThr 1 5 

121 TGTACAGCTTCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGACAAAG 
16 CysThrAI aSerGl nlysLysSer II eGl nPheHI sTrpLysAsnSerAsnGl nThrLys 

ATTCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGTT 240 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgVal 55 

241 GACTCAAGAAGAAGCCTTTGGGACCAAGGAAACnTACCCTGATCATCAAGAATCnAAG 
56 AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeuIlelleLysAsnLeuLys 
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ATA6AAGACTCAGATACTTACATCTGT6AAGTG6G66ACCAGAA66A6GA66T6CAATT6 360 
I1eG1uAspSerAspThrTyrI1eCy$GluVa1G1yAspG1nLysG1uG1uVa1G1nLeu 95 

361 CTAGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCCTGACC 
96 LeuVal PheGlyLeuThrAl aAsnSerAspThrM sLeuleuGI nGlyGl nSerleuThr 

CTGACCTT6GAGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAAGGGGT 360 
LeuThrLeuGI uSerProProGlySerSerProSerValGI nCysArgSerProArgGly 135 

481 AAAAACATACAGGGGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGC 
136 LysAsnI1eG1nG1yG1yLysThrLeuSerVa1SerG1nLeuGluLeuGlnAspSerG1y 

ACCTGGACATGCACTGTCTTGCAGAACCAGAAGAAAGTGGAGTTCAAAATAGACATCGTG 600 
ThrTrpThrCysThrVaVlTeuGl nAsnGl nLysLys Val G1 uPheLys 1 1 eAspIl eVal 1 75 

601 GTGCTAGCTTTCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 
1 76 Val LeuAl aPheGI nLysAl aSerSer I 1 eVal TyrLysLysGI uGlyGl uGl nVal G1 u 

TTCTCCTTCCCACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 720 
PheSerPheProLeuAlaPheThrValGluLysLeuThrGlySerGlyGluLeuTrpTrp 215 

721 CAGGCGGAGAGGGCTTCCTCCTCCAAGTCTTGGATCACCTTTGAGCTGAAGAACAAGGAA 
216 GlnAlaGluArgAlaSerSerSerLysSerTrpIleThrPheAspLeuLysAsnLysGlu 

GTGTCTGTAAAACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 840 
ValSerValLysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 255 

841 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCC 
256 Hi sLeuThrLeuProGI nAl aLeuProGl nTyrAl aGlySerGlyAsnLeuThrLeuAl a 



CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTCGTGGTGATGAGAGCCACT 840 

LeuGI uAI aLysThrGlyLy sLeuHl sGl nGl uVal AsnleuVal Val NetArgAl aThr 295 

961 CAGCTCCAGAAAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTG 
296 GInLeuGlnLysAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuMetLeu 

AGCTTGAAACTGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGAGAAGGCGGTGTGGGTG 1080 

SerLeuLysLeuGluAsnLysGluAlaLysValSerLysArgGluLysAlaValTrpVal 335 
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1081 CTGAACCCTGA66CGGGGATGTGGCAGT6TCTGCTGA6TGACTCGGGACAGGTCCTGCT6 
336 LeuAsnProGl uAl aGlyMetTrpGl nCysleuLeuSerAspSerGlyGl nVal LeuLeu 



GAATCCAACATCAAGGTTCTGCCCACATGGTCCACCCCGGTGCAGCCAATGGCCCT6ATT 1200 
G1 uSerAsnlleLysVal LeuProThrTrpSerThrProValGlnProMetAl aLeuIle 375 



1201 GTGCTGGGGGGCGTCGCCGGCCTCCTGCTTTTCATTGGGCTAGGCATCTTCTTCTGTGTC 
376 ValLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlynePhePheCysVal 



AGGTGCCGGCACCGAAGGCGCCAAGCACAGCGGATGTCTCAGATCAAGAGACTCCTCAGT' 1320 
ArgCysArgH 1 sArgArgArgGl nAl aGl nArgMetSerGl n 1 1 eLysArgleuleuSe r 4 1 5 

1321 GAGAAGAAGACCTGCCAGTGCCCTCACCGGTTTCAGAAGACATGTAGCCCCATTTGA 1377 
416 GluLysLysThrCysGlnCysProHisArgPheGlnLysThrCysSerProIleEnd 433 



or a degenerate variant thereof. 

The invention also relates to a nucleic acid molecule specifying a soluble chimpanzee CD4 (domain I) 
which binds HIV or SIV gp120. comprising the following DNA sequence: 



1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCACTCCTCCCA 
-25 MetAsnArgGlyValProPheArgHisLeuLeuLeuValLeuGlriLeuAlaLeuLeuPro 

GCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGACACAGTGGAACTGACC 120 

Al aAl aThrGI nGlyLysLysVal Val LeuGlylysLysGlyAspThrVal Gl uLeuThr 1 5 



121 TGTACAGCTTCCCAGAAGAAGAGCATACAAnCCACTGGAAAAACTCCAACCAGACAAAG 
16 CysThrAlaSerGlnLysLysSerlleGlnPheHisTrpLysAsnSerAsnGlnThrLys 

AnCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGTT 240 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgVal 55 



241 GACTCAAGAAGAAGCCTTTGGGACCAAGGAAACTTTACCCTGATCATCAAGAATCTTAAG 
56 AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeullelleLysAsnLeuLys 



ATAGAAGACTCAGATACTTACATCTGTGAAGTGGGGGACCAGAAGGAGGAGGTGCAATTG 360 
II eGl uAspSerAspThrTyr II eCysGI uVal GlyAspGl nLysGI uGl uVal Gl nLeu 95 



361 CTAGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTT 
96 LeuVal PheGlyLeuThrAlaAsnSerAspThrHlsLeuLeu 



or a degenerate variant thereof. 
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The invention also relates to a nucleic acid molecule specifying chimpanzee CD4 with the cytoplasmic 
domain, comprising the following ONA sequence: 



1 ATGAACCG6GGA6TCCCTTT7AGGCACTTGCTTCTGGTGCTGCAACTGGCACTCCTCCCA 
-25 MetAsnArgGlyValProPheArgHlsLeuLeuLeuVaUeuGlnLeuAlaLeuLeuPro 



6CAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAGAAA6GGGACACAGTGGAACTGACC 120 
Al aAl aThrGl nGlyLyslysVal Val LeuGlyLysLysGlyAspThrVal Gl uLeuThr 1 5 



1 2 1 TGTACAGCTTCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGAYAAAG 
16 CysThrAl aSerGl nLysLysSerll eGl nPheHi sTrpLysAsnSerAsnGl nThrLys 

He 

ATTCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGYT 240 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgVal 55 

Ala 

241 GACTCAAGAAGAAGCCnTGGGACCAAGGAAACTTTMCCCTGATCATCAAGAATCTTAAG 
56 AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeuIlelleLysAsnLeuLys 

Pro 

ATAGAAGACTCAGATACTTACATCTGTGAAGTGGGGGACCAGAAGGAGGAGGTGCAATTG 360 
IleGluAspSerAspThrTyrlleCysGluValGlyAspGlnLysGluGluValGlnLeu 95 

361 CTAGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCCTGACC 
96 LeuVal PheGlyLeuThrAl aAsnSerAspThrHI sLeuLeuGl nGlyGl nSerLeuThr 



CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAG7CCAAGGGGT 360 
LeuThrLeuGluSerProProGlySerSerProSerValGlnCysArgSerProArgGly 135 
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481 AAAAACATACAGGGGGGGAAGACCCTCTCCGTGTCTCAGCT6GAGCTCCA6GATAGTGGC 
136 LysAsnll eGl nGl yGl yLysThrLeuSerVal SerGl nLeuGl uLeuGl nAspSerGly 

ACCTGGACATGCACTGTCTTGCAGAACCAGAAGAAAGTGGAGTTCAAAATAGACATCGTG 600 
ThrTrpThrCysThrVal LeuGl nAsnGI nlysLysVal Gl uPheLys II eAsp II eVal 1 75 

601 GTGCTAGCTnCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 
176 Val leuAl aPheGl nLy sAl aSerSer II eVal TyrLy s Ly $G1 uGl yGl uGl nVal Gl u 

TTCTCCTTCCCACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 720 
PheSerPheProLeuAlaPheThrValGluLysLeuThrGlySerGlyGluLeuTrpTrp 215 

721 CAGGCGGAGAGGGCTTCCTCCTCCAAGTCTTGGATCACCTTTGACCTGAAGAACAAGGAA 
216 Gl nAl aGl uArgAl aSerSerSerLy^SerTrplUThrPheAspLeuLysAsnlysGlu 

GTGTCTGTAAAACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 840 
ValSerValLysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 255 

841 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCC 
256 H1 $LeuThrLeuProGl nAl aLeuProGl nTyrAl aGlySerGlyAsnLeuThrLeuAl a 

CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTCGTGGTGATGAGAGCCACT 840 
LeuGluAlaLysThrGlyLysLeuHlsGlnGluValAsnLeuValValMetArgAlaThr 295 

961 CAGCTCCAGAAAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTG 
296 GlnLeuGlnLysAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuMetLeu 

AGCTTGAAACTGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGAGAAGGCGGTGTGGGTG 1080 
SerLeuLysLeuGl uAsnLysGl uAl aLysVal SerLysArgGI uLy sAl aVal TrpVal. 335 



1081 CTGAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTG 
336 LeuAsnProGluAlaGlyMetTrpGlnCysLeuLeuSerAspSerGlyGlnValLeuLeu 



GAATCCAACATCAAGGTTCTGCCCACATGGTCCACCCCGGTGCAGCCAATGGCCCTGATT 1200 
Gl uSerAsnl 1 eLysVal LeuProThrTrpSerThrProVal Gl nProMetAl aLeuIl e 375 
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1201 6TGCTGGGGGGCGTCGCCGGCCTCCT6CTTnCAnG6GCTAGGCA7CTTCTTCTGTGTC 
376 ValLeuG1>G1yValAlaGlyLeuLeuLeuPheneG1yLeuGlyIlePhePheCysVal 



AGGTGCCGGCACCGAAGGCGCCAAGCASAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 1320 
ArgCysArgHlsArgArgArgGlnAlaGlnArgMetSerGlnlleLysArgLeuLeuSer 415 

Glu 

1321 GAGAAGAAGACCTGCCAGTGCCCTCACCGGTnCAGAAGACATGTAGCCCCATTTGA 1377 
416 GluLysLysThrCysGlnCysProHlsArgPheGlnLysThrCysSerProIleEnd 433 

75 

wherein Y is C or T. 
M is A or C, and 
S is C or Q; 
— or a degenerate variant thereof. 

The invention also relates to a nucleic acid molecule specifying a chimpanzee CD4 fragment 
comprising the following DNA sequence: 

1 ATGAACCGGGGAGTCCCnTTAGGCACTTGCTTCTGGTGCTGCAACTGGCACTCCTCCCA 
25 -25 MetAsnArgGlyVal ProPheArgHi sLeuLeuLeuVal LeuGlnLeuAl aLeuLeuPro 

GCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGACACAGTGGAACTGACC .120 
Al aAl aThrGI nGl yLyslysVal Val LeuGlyLysLysGlyAspThrVal Gl uLeuThr 15 



30 



35 



121 TGTACAGCTTCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGAYAAAG 
16 CysThrAI aSerGl nLysLysSer II eGl nPheHi sTrpLysAsnSerAsnGl nThrLys 

lie 

ATTCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAA7GATCGCGYT 240 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgVal 55 

Ala 



40 



45 



241 GACTCAAGAAGAAGCCTnGGGACCAAGGAAACTTTMCCCTGATCATCAAGAATCTTAAG 
56 AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeuIlelleLysAsnLeuLys 

Pro 

ATAGAAGACTCAGATACnACATCTGTGAAGTGGGGGACCAGAAGGAGGAGGTGCAATTG 360 
II eGl uAspSerAspThrTyr II eCysGI uVal GlyAspGl nLysGl uGl uVal Gl nLeu 95 

50 361 CTAGTGTTCGGATT6ACTGCCAACTCTGACACCCACCTGCTT 
96 LeuVal PheGlyLeuThrAI aAsnSerAspThrH i sLeuLeu 

wherein Y is C or T, and 
55 M is A or C; 

or a degenerate variant thereof. 

The invention also relates to a nucleic acid molecule specifying a gp120 binding molecule capable of 
glycosylation which is related to human C04 with the cytoplasmic domain, comprising the following ONA 
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sequence: 



1 AT6AACCGGGGA6TCCCTTTTAGGCACTTGCTTCTGGTGCTGCAAC7GGCGCTCCTCCCA 
•25 MetAsnArgGlyValProPheArgH1$LeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

GCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAAAAAGGGGATACAGTGGAACTGACC 
Al aAl aThrGI nGlyLyslysVal Val LeuGlylysLysGlyAspThrVal Gl uLeuThr 

121 TGTACAGCTTCCCAGAAGAAGAGCATACAAnCCACTGGAAAAACTCCAACCAGAYAAAG 
16 CysThrAlaSerGlnLysLysSerlleGlnPheHisTrpLysAsnSerAsnGlnThrLys 

He 

ATTCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGCT 
I 1 eLeuGlyAsnGl nGlySerPheleuThrlysGlyProSerLysLeuAsnAspArgAl a 

241 GACTCAAGAAGAAGCCnTGGGACCAAGGAAACTTCMCCCTGATCATCAAGAATCTTAAG 
56 AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeuIleHeLysAsnLeuLys 

Pro 

ATAGAAGAcfCAGATACTTACATCTGTGAAGTGGAGGACCAGAAGGAGGAGGTGCAATTG 
I 1 eGl uAspSerAspThrTyrl 1 eCysGl uVal Gl uAspGl nLysGl uGl uVal Gl nLeu 
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361 CTAGTGTTCGGATTGACT6CCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCCT6ACC 
96 leuVal PheGl yleuThrAI aAsnSerAspThrHI sLeuLeuGl nGlyGI nSerLeuThr 

CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAAGGGGT 360 
LeuThrLeuGluSerProProGlySerSerProSerValGlnCysArgSerProArgGly 135 

481 AAAAACATACAGGGGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGC 
136 LysAsnlleGlnGlyGlyLysThrLeuSerValSerGlnLeuGluLeuGlnAspSerGly 

ACCTGGACATGCACTGTCTTGCAGAACCAGAAGAAGGTGGAGTTCAAAATAGACATCGTG 600 
ThrTrpThrCysThrValLeuGlnAsnGlnLysLysValGluPheLysIleAspIleVal 175 

601 GTGCTAGCTTTCCAGAAGGCCTCCA5CATAGTCTATAAGAAA5AGGGGGAACAGGTGGAG 
176 ValLeuAlaPheGlnLysAlaSerSerlleValTyrLysLysGluGlyGluGlnValGlu 

TTCTCCnCCCACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 720 
PheSerPheProLeuAl aPheThrVal Gl ulysleuThrGlySerGlyGl uLeuTrpTrp , 215 

721 CAGGCGGAGAGGGCTTCCTCCTCCAAGTCTTGGA7CACCTTTGACCTGAAGAACAAGGAA 
216 GlnAlaGluArgAlaSerSerSerLysSerTrpIleThrPheAspLeuLysAsnLysGlu 

GTGTCTGTAAAACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 840 
ValSerVaUysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 255 

841 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCC 
256 H1 sLeuThrLeuProGl nAl aLeuProGl nTyrAl aGlySerGlyAsnLeuThrLeuAl a 

CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTGGTGGTGATGAGAGCCACT 840 
LeuGl uAl a LysThrGl yLys LeuHi sGl nGl uVal AsnLeuVal Val MetArgAl aThr 295 

961 CAGCTCCAGMAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTG 
296 Gl nleuGl nLysAsnLeuThrCysGl uValTrpGlyProThrSerProLysLeuMetLeu 

AGCTTGAAACTGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGAGAAGGCGGTGTGGGTG 1 080 
SerLeuLysLeuGluAsnLysGluAlaLysValSerLysArgGluLysAlaValTrpVal 335 



56 
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is 



1081 CTGAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTG 
336 LeuAsnProGluAlaGlyHetTrpGlnCysLeuLeuSerAspSerGlyGlnValLeuLeu 



GAATCCAACATCAAGGTTCTGCCCACATGGTCCACCCCGGTGCAGCCAATGGCCCT6ATT 1200 
GluSerAsnlleLysValLeuProThrTrpSerThrProValGlnProMetAULeuIle 375 



1201 GTGCTGGGGGGCGTCGCCGGCCTCCTGCTTTTCATTGGGCTAGGCATCTTCTTCTGTGTC 
376 VaUeuGlyGlyValAlaGlyleuLeuLeuPhelleGlyLeuGlyllePhePheCysVal 



AGGTGCCGGCACCGAAGGCGCCAAGCAGAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 1320 
ArgCysArgHi sArgArgArgGlnAI aGI uArgMetSerGlnll eLysArgLeuleuSer 415 



20 



1321 GAGAAGAAGACCTGCCAGTGCCCTCACCGGTTTCAGAAGACATGTAGCCCCATTTGA 1377 
416 GluLysLysThrCysGlnCysProHlsArgPheGlnLysThrCysSerProIleEnd 433 

25 

wherein Y is C or T, and 
M is A or C: 

or a degenerate variant thereof; 

with the proviso that both Y is not T and M is not C at the same time. 
30 The invention also relates to a nucleic acid molecule specifying a gp120 binding molecule capable of 
glycosylation which is related to a human CD4 fragment, comprising the following DNA sequence: 

1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCGCTCCTCCCA 
« -25 MetAsnArgGlyValProPheArgHlsLeuLeuLeuVaUeuGlnLeuAlaLeuLeuPro 

GCAGCCACTCAGGGAAAGAAAGTG6TGCTGGGCAAAAAAGGGGATACA6TGGAACTGACC 120 
AlaAlaThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 15 

4Q 



45 



50 



55 
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121 TGTACA6CTTCCCAGAAGAAGAGCATACAATTCCACTG6AAAAACTCCAACCAGAYAAAG 
16 CysThrAl aSerGl nLysLysSer II eGl nPheHI sTrpLysAsnSerAsnGI nThrLys 

lie 

ATTCTGGGAAATCAGGGCTCCTTCnAACTAAAGGTCCATCCAAGCTGAATGATCGCGCT 240 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgAla 55 



241 GACTCAAGAAGAAGCCTTTGGGACCAAGGAAACTTCMCCCTGATCATCAAGAATCTTAAG 
56 AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeuIlelleLysAsnLeuLys 

Pro 

ATAGAAGACTCAGATACTTACATCTGTGAAGTGGAGGACCAGAAGGAGGAGGTGCAATTG 360 
II eGl uAspSerAspThrTyr 1 1 eCy sGI uVal Gl uAspGl nLysGl uGl uVal Gl nleu 95 



361 CTAGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTT 
96 LeuVaVtheGlyLeuThrAlaAsnSerAspThrHisLeuLeu 

wherein Y is C or T, and 
M is A or C; 

or a degenerate variant thereof; 

with the proviso that both Y is not T and M is not C at the same time. 

The invention also relates to a nucleic acid molecule specifying a fusion protein, comprising 

1) a nucleic acid molecule specifying non-human primate CD4 or fragment thereof which binds HIV or 
SIVgp120, and 

2) a nucleic acid molecule specifying an immunoglobulin light or heavy chain, wherein the nucleic acid 
molecule which specifies, the variable region of said immunoglobulin chain has been replaced with the 
nucleic acid molecule specifying said non-human primate CD4,or fragment thereof. 

The invention also relates to a nucleic acid molecule specifying a fusion protein, comprising 

1) a nucleic acid molecule specifying non-human primate CD4, or fragment thereof which binds HIV or 
SIV gp120. linked to 

2) a nucleic acid molecule specifying a cytotoxic polypeptide. 

The invention also relates to vectors comprising the nucleic acid molecules of the invention. 

The invention also relates to hosts transformed with the vectors of the invention. In particular, the 
invention relates to hosts which express complementary immunoglobulin light or heavy chains together with 
the expression product of said fusion protein nucleic acid molecule to give an immunoglobulin-like molecule 
which binds to HIV or SIV gp120. 

The invention also relates to methods of producing non-human primate CD4 ( or fragment thereof which 
binds to HIV or SIV gp120, which comprises 

cultivating in a nutrient medium under protein-producing conditions, a host strain transformed with a vector 
containing a nucleic acid molecule specifying a non-human primate CD4 or soluble fragment thereof which 
binds HIV or SIV gp120, said vector further comprising expression signals which are recognized by said 
host strain and direct expression of said non-human primate CD4 or fragment thereof, and 
recovering the non-human primate CD4 or soluble fragment thereof so produced. 

The invention also relates to a method of producing a fusion protein comprising non-human primate 
CD4, or fragment thereof which binds to gp120, and an immunoglobulin light or heavy chain, wherein the 
variable region of the immunoglobulin chain has been substituted with non-human primate CD4, or fragment 
thereof which binds to HIV or SIV gp120. which comprises 

cultivating in a nutrient medium under protein-producing conditions, a host strain transformed with a vector 
specifying said fusion protein, said vector further comprising expression signals which are recognized by 
said host strain and direct expression of said fusion protein, and 
recovering the fusion protein so produced. 

In particular, the invention relates to a method of preparing a immunoglobulin-like molecule, wherein 
said host strain is a myeloma cell line which produces immunoglobulin light chains and said fusion protein 
comprises an immunoglobulin heavy chain of the class IgM, IgGI or lgG3, wherein an immunoglobulin-like 
molecule comprising said fusion protein is produced. The invention also relates to a method of preparing an 
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immunoglobulin-like molecule, wherein said host produces immunoglobulin heavy chains of the class IgM, 
lgG1 and lgG3 together with said fusion protein comprising an immunoglobulin light chain to give an 
jmmunoglobulln-like molecule which binds to HIV or SIV gp120. 

The invention also relates to substantially pure non-human primate CD4. In particular, the invention 
relates to substantially pure rhesus CD4 comprising the following amino acid sequence: 

MetAsnArgGly II eProPheArgM sLeuLeuleuVal LeuGl nleuAl aLeuLeuPro 
AlaValThrGlnGlyLysLysValVaUeuGlyLysLysSlyAspThrValGluLeuThr 
CysThrAl aSerGl nLysLysAsnThrGl nPheHI $TrpLysAsnSer AsnGl n II eLys 
II eleuGlyll eGl nGlyLeuPheLeuThrLysGlyProSerlysLeuSerAspArgAl a 
AspSerArgLysSerLeuTrpAspGlnGlyCysPheSerMetllelleLysAsnLeuLys 
IleGluAspSerAspThrTyrlleCysGluValGluAsnLysLysGluGluValGluLeu 
LeuVal PheGlyLeuThrAl aAsnSerAspThrM sLeuLeuGI uGlyGl nSerLeuThr 
LeuThrLeuGl uSerProProGlySerSerProSerVal LysCysArgSerProGlyGly 
LysAsnll eGl nGlyGl yArgf hrlTeSerVal ProGl nLeuGl uArgGl nAspSerGly 
ThrTrpThrCysThrVal SerGI nAspGI nLy sThrVal Gl uPheLys II eAsp II eVal 



ValLeuAl aPheGl nLysAl aSerSerThrValTyrLysLysGl uGlyGl uGl n Val Gl u 
PheSerPheProLeuAl aPheThrleuGl uLysLeuThrGlySerGlyGl uLeuTrpTrp 
Gl nAlaGl uArgAI »SerSerSerLy$SerTrpIleThrPheAspLeuLysAsnLysGl u 
ValSerVaUysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 
HIsLeuThrLeuProGlnAlaLeuProGlnTyrAlaGlySerGlyAsnLeuThrLeuAla 
LeuGl uAl alysThrGl yLysLeuHl sGl nGI uVal AsnLeuVal Val Met ArgAl aThr 
GInPheGlnGluAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuThrLeu 
SerLeuLysLeuGluAsnLysGlyAlaThrValSerLysGlnAlaLysAlaValTrpVal 
LeuAsnProGluAlaGlyHetTrpGlnCysLeuLeuSerAspSerGlyGlnVaUeuLeu 
Gl uSerAsn 1 1 eLysVal Val ProThrTrpProThrProVal Gl nProMetAl aLeu II e 
VaHeuGlyGlyValAlaGlyLeuLeuLeuPheThrGlyLeuGlyllePhePheCysVal 
ArgCysArgHi sArgArgArgGl nAI aGI uArgNetSerGI n 1 1 eLys Argleu LeuSer 
Gl uLysLysThrCysGl nCysProHl sArgPheGl nLysThrCysSerProI 1 e . 

The invention also relates to substantially pure chimpanzee CD4 comprising the following amino acid 
sequence: 
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MetAsnArgGlyValProPheArgHisleuLeuleuVaUeuGlnLeuAlaLeuLeuPro 
A1 aAl aThrGl nGlyLysLysVal Val LeuGlyLyslysGlyAspThrVal G1 uLeuThr 
CysThrAl aSerGl nLysLysSer IleGI nPheHI sTrpLysAsnSerAsnGl nThrLys 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerlysLeuAsnAspArgVal 
AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeuIlelleLysAsnLeuLys 
II eGl uAspSerAspThrTyrll eCysGI uVal GlyAspGI nLysGI uGl uVal G1 nLeu 
LeuVal PheGI yLeuThrAI aAsnSerAspThrHI sLeuLeuGl nGl yGI nSerLeuThr 
LeuThrLeuGl uSerProProGlySerSerProSerVal Gl nCysArgSerProArgGly 
LysAsnll eGl nGlyGlyLysThrLeuSerVal SerGI nLeuGI uLeuGl nAspSerGly 
ThrTrpThrCysThrVaHeuGI nAsnGI nLy sLysVal Gl uPheLy s II eAsp 1 1 eVal 
Val LeuAl aPheGI nLysAI aSerSer II eVal TyrLysLysGI uGl yGI uGl n Val Gl u 
PheSerPheProLeuAl aPheThrVal Gl uLysleuThrGlySerGlyGI uLeuTrpTrp 
GlnAlaGTuArgAlaSerSerSerLysSerTrpIleThrPheAspLeuLysAsnLysGlu 
ValSerValLysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 
Hi sLeuThrLeuProGI nAI aLeuProGI nTyrAl aGl ySerGlyAsnLeuThrLeuAl a 
LeuGluAlaLysThrG1yLysLeuH1sGlnGluValAsnLeuVa1Va1NetArgAlaThr 
GlnLeuGlnLysAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuMetLeu 

SerLeuLysLeuGI uAsnLysGl uAI aLysVal SerLysArgGI uLysAI aVal TrpVal 
LeuAsnProGluAlaGlyMetTrpGlnCysLeuLeuSerAspSerGlyGlnValLeuLeu 
GluSerAsnlleLysVaUeuProThrTrpSerThrProValGlnProMetAlaLeuIle 
ValLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlyllePhePheCysVal 
ArgCysArgHi sArgArgArgGI nAI aGl nArgMetSerGI nlleLysArgLeuLeuSer 
GluLysLysThrCysGlnCysProHlsArgPheGlnLysThrCysSerProIle; or 
the glycosylated derivative thereof. 

The invention also relates to substantially pure non-human CD4 molecule comprising the following 
amino acid sequence: 
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HetAsnArgGlyValProPheArgHlsLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 
Al aAl aThrGl nGlyLyslysVal VaUeuGlyLysLysGlyAspThrValGl uLeuThr 
CysThrAl aSerGl nlys LysSer II eGl nPheM sTrpLysAsnSerAsnGI n-9-Lys 
IleLeuG1yAsnGlnGlySerPheLeuThrLysG1yProSerLysLeuAsnAspArg-#- 
AspSerArgArgSerLeuTrpAspG1nGlyAsnPhe-$-LeuIleI1eLysAsnLeuLys 

1 1 eGl uAspSer AspThrTyr 1 1 eCy sGl uVal G1 yAspGl nLysGl uGl uVal Gl nLeu 
LeuValPheGlyLeuThrAlaAsnSerAspThrHlsLeuLeuGlnGlyGlnSerLeuThr 

LeuThrLeuGluSerProProGlySerSerProSerValGlnCysArgSerProArgGly 
LysAsn 1 1 eGl nGl yGI yLysThrLeuSerVal SerGl nLeuGl uLeuGI nAspSerGly 
ThrTrpThrCysThrVal LeuGI nAsnGl nLysLysVal Gl uPhelys II eAsp II eVal 
Val LeuAl aPheGl nLysAI aSerSerll eVal TyrLysLysGl uGlyGl uGl nVal Gl u 
PheSerPheProLsuAlaPheThrValGluLysLeuThrGlySerGlyGluLeuTrpTrp 
GlnAI aGl uArgAl aSerSerSerLysSerTrpIl eThrPheAspLeuLysAsnlysGl u 
ValSerValLysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 
Hi sLeuThrLeuProGl nAI aLeuProGl nTyrAl aGlySerGlyAsnLeuThrLeuAl a 
LeuGI uAl aLysThrGl yLysLeuHl sGl nGl uVal AsnLeuVal Val HetArgAl aThr 
GlnLeuGlnLysAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuMetLeu 
SerLeuLys LeuGI uAsnLysGI uAl alys Val SerLysArgGl uLy $A1 aVal TrpVal 
LeuAsnProGl uAl aGl yHetTrpGl nCysLeuLeuSerAspSerGlyGI nVal LeuLeu 
GluSerAsnIleLysValLeuProThrTrpSerThrPrt>ValGlnProH«tAlaLeuIle 
ValLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlyllePhePheCysVal 
ArgCysArgHisArgArgArgGI nAI a-X-ArgMetSerGl nlleLysArgLeuLeuSer 



GluLysLy$ThrCy$GlnCy$ProHisArgPheG1nLysThrCysSerProIle, 

wherein 

-@- is Thr or He, 

•4- is Val or Ala, 

-$- is Thr or Pro, and 

-%- is Gin or Glu; or 

the glycosylated derivative thereof. 

The invention also relates to a gp120 binding molecule related to human CD4 comprising the following 
amino acid sequence: 
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. MetAsnArgGlyVal ProPheArgHi sleuleuleuVal LeuGl nLeuAl aLeuLeuPro 
Al aAl aThrGl nGlyLyslysVal Val LeuGlyLysLysGlyAspThrVal Gl uLeuThr 
CysThrAlaSerGlnLysLysSerlleGlnPheHlsTrpLysAsnSerAsnGln-e-Lys 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgAla 
AspSerArgArgSerLeuTrpAspGlnGlyAsnPhe-J-LeuIlelleLysAsnLeuLys 
I 1 eGl uAspSerAspThrTyr II eCysGI uVal Gl uAspGl nLysGI uGl uVal Gl nLeu 
LeuVal PheGlyLeuThrAI aAsnSerAspThrHI sLeuLeuGI nGlyGl nSerLeuThr 
LeuThrleuGl uSerProProGlySerSerProSerVal Gl nCysArgSerProArgGly 
LysAsnll eGl nGlyGlyLysThrLeuSerVal SerGl nLeuGI uLeuGl nAspSerGly 
ThrTrpThrCysThrVaUeuGlnAsnGlnLysLysValGluPheLysIleAspneVal 
Val LeuAl aPheGl nlysAl aSerSerll eVal TyrLysLysGI uGlyGl uGl nVal Gl u 
PheSerPheProLeuAlaPheThrValGluLysLeuThrGlySerGlyGluLeuTrpTrp 
Gl nAl aGI uArgAl aSerSerSerLysSerTrpIleThrPheAspLeuLysAsnLysGl u 
ValSerValLysArgValThrGlnAspProLysLeuGlnHetGlyLysLysLeuProLeu 
HIsLeuThrLeuProGlnAlaLeuProGlnTyrAlaGlySerGlyAsnLeuThrLeuAla 
LeuGluAlaLysThrGlyLysLeuHisGlnGluValAsnLeuValValMetArgAlaThr. 
GlnLeuGlnLysAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuHetLeu 
SerLeuLysLeuGI uAsnLysGl uAl aLysVal SerLysArgGI uLysAl aVal TrpVal 
LeuAsnProGl uAl aGlyHetTrpGl FiCysLeuLeuSerAspSerG lyGl nVal LeuLeu 
Gl uSerAsnl 1 eLysVal LeuProThrTrpSerThrProVal Gl nProMetAl aLeuIl e 

ValLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlyllePhePheCysVal 

ArgCysArgHI sArgArgArgGl nAl aGI uArgMetSerGI nlle LysArgLeuLeuSer 
GluLysLysThrCysGlnCysProHisArgPheGlnLysThrCysSerProIle, 

wherein 

-@- is Thr or lie, and 

-$- is Thr or Pro; or 

the glycosylated derivative thereof; 

with the proviso that at least one of -@- and -$- is Thr. 

The invention also relates to non-human primate CD4 fragments which binds to HIV or SIV gp120. 
Preferably, such non-human primate CD4 fragments are soluble in aqueous solution. 

In particular, the invention relates to a soluble C04 fragment which is derived from the rhesus monkey 
and comprises the following amino acid sequence: 
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HetAsnArgGl y II eProPheArgM sLeuLeuLeuVal LeuGl nLeuAl aLeuLeuPro 
A1 aVal ThrGI nGlyLysLysVal Val LeuGlyLysLysGlyAspThrVal G1 uLeuThr 
CysThrAl aSerGl nLysLys AsnThrGl nPheHi sTrpLy sAsnSerAsnGl nl 1 eLy $ 

8 lleLeuGlylleGlnGlyLeuPheLeuThrLysGlyProSerLysleuSerAspArgAla 
AspSerArgLysSerLeuTrpAspGlnGlyCysPheSerMet Hell eLy sAsnLeuLys 
IleGluAspSerAspThrTyrlleCysGluValGluAsnLysLysGluGluValGluLeu 

to LeuValPheGlyLeuThrAlaAsnSerAspThrHlsLeuleu. 

The invention also relates to a soluble chimpanzee CD4 fragment comprising the following amino acid 
sequence: 

T5 

HetAsnArgGlyValProPheArgHisLeuLeuLeuVaHeuGlnLeuAlaLeuLeuPro 
Al aAl aThrGl nGlyljsLysVal Val LeuGlyLysLysGlyAspThrVal G1 uLeuThr 
so CysThrAlaSerGlnLysLysSerI1eGlnPheH1sTrpLysAsnSerAsnG1nThrLys 
II eLeuGlyAsnGI nGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgVal 
AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeuIlelleLysAsnLeuLys 
IleGluAspSerAspThrTyrlleCysGluValGlyAspGlnLysGluGluValGlnLeu 
LeuValPheGlyLeuThrAlaAsnSerAspThrHlsLeuLeu. 



The invention also relates to a gp120 binding molecule capable of glycosylate comprising the 
following amino acid sequence: 

30 

HetAsnArgGl y Val ProPheArgHI sLeuLeuLeuVal LeuGl nLeuAl aLeuLeuPro 
Al aAl aThrGl nGlyLysLysVal Val LeuGlyLysLysGlyAspThrVal Gl uLeuThr 
CysThrA1aSerGlnLysLysSerIleGlnPheH1sTrpLysAsnSerAsnGln-e-Lys 

35 

IleLeuGlyAsnGlnGlySerPheUuThrLysGlyProSerLysLeuAsnAspArg-*- 
AspSerArgArgSerLeuTrpAspGlnGlyAsnPhe-S-LeuIlelleLysAsnLeuLys 
1 1 eGl uAspSerAspThrTyr II eCysGl uVal Gl yAspGl nLysGl uGl uVal Gl nLeu 
40 LeuVal PheGl yLeuThrAl aAsnSerAspThrHi sLeuLeu 

wherein 

-@- is Thr or lie, 
^ is Val or Ala, and 

-$- is Thr or Pro; or 

the glycosylated derivative thereof. 

The invention also relates to gp120 binding molecule capable of glycosylate related to human CD4 

fragments. In particular, the invention relates to a glycosylated human CD4 fragment comprising the 
so following amino acid sequence: 



55 
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HetAsnArgGly Val ProPheArgHi sleuLeuLeuVal LeuGl nLeuAl aLeuLeuPro 

Al aAl aThrGI nGlyLysLysVal Val LeuGlyLysLysGlyAspThrVal G1 uLeuThr 
CysThrAlaSerGlnLysLysSerIleGlnPheHisTrpLysAsnSerAsnGln-8-Lys 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgAU 
AspSerArgArgSerLeuTrpAspG1nG1yAsnPhe-$-LeuIleI1eLysAsnLeuLys 
II eGl uAspSerAspThrTyr I 1 eCy sGl uVal Gl uAspGI nLy sGl uGl uVal G1 nLeu 
LeuVal PheGlyLeuThrAI aAsnSerAspThrW sLeuLeu 

wherein 

-@- is Thr or lie, and 

-$- fs Thr or Pro; or 

the glycosylated derivative thereof; 

with the provisoTnat at least one of -@- and -$- is Thr. 

The invention also relates to fusion proteins, comprising non-human primate CD4 or gp120 binding 
molecules of the invention, or HIV or SIV binding fragments thereof, linked to a cytotoxic polypeptide. 

The invention also relates to a fusion protein comprising non-human primate CD4 or gp120 binding 
molecules of the invention, or fragments thereof which are capable of binding to HIV or SIV gp120, fused at 
the C-terminus to a second protein which comprises an immunoglobulin heavy chain of the class IgM, IgGl 
or lgG3, wherein the variable region of said heavy chain immunoglobulin has been replaced with CD4, or 
HIV gp120-binding fragment thereof. 

The invention also relates to an immunoglobulin-like molecule, comprising: 

(1) a fusion protein of non-human primate CD4 or fragment thereof which binds to HIV or SIV gp120 and 
an immunoglobulin heavy chain, linked to 

(2) an immunoglobulin light chain. 

The invention also relates to a fusion protein comprising non-human primate CD4 or gp120 binding 
molecules of the invention, or fragment thereof which binds to HIV or SIV gp120, fused at the C-terminus to 
a second protein comprising an Immunoglobulin light chain where the variable region has been deleted. 

The invention also relates to an Immunoglobulin-like molecule comprising: 

1) a fusion protein of non-human primate CD4 or gp120 binding molecule of the invention, or fragment 
thereof which binds to HIV or SIV gp120, and an immunoglobulin light chain, Jinked to 

2) an immunoglobulin heavy chain. 

The invention also relates to pharmaceutical compositions, comprising 

1 ) a therapeutically effective amount of a non-human primate CD4, and 

2) a pharmaceutical^ acceptable carrier. 

The invention also relates to pharmaceutical compositions, comprising 

1) a therapeutically effective amount of a soluble non-human CD4 fragment, and 

2) a pharmaceutical^ acceptable carrier. 

The invention also relates to pharmaceutical compositions comprising the proteins, glycoproteins, fusion 
proteins and immunoglobulin-like molecules of the invention. 

The invention also relates to complexes between the substantially pure non-human primate CD4 and 
HIV or SIV gp120. 

The invention also relates to complexes comprising the non-human primate CD4 fragments of the 
invention and HIV or SIV gp120. 

The invention also relates to complexes comprising the fusion proteins and immunoglobulin-like 
molecules of the invention and HIV or SIV gp120. 

The invention also relates to complexes between the gp120 binding molecules capable of glycosylation 
and HIV or SIV gp1 20. 

The invention also relates to a method of treating HIV or SIV infections, comprising administering to an 
animal in need of such treatment a therapeutically effective amount of substantially pure non-human primate 
CD4, or a soluble fragment thereof. 

The invention also relates to a method of treating HIV or SIV infections, comprising administering to an 
animal in need of such treatment a therapeutically effective amount one of the fusion proteins of the 
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invention. 

The invention also relates to a method of treating HIV or SIV infections, comprising administering to an 
animal In need of such treatment a therapeutically effective amount one of the immunoglobulin-like 
molecules of the invention. 

The invention also relates to a method of treating HIV or SIV infections, comprising administering to an 
animal in need of such treatment a therapeutically effective amount of the gp120 binding molecules of the 
invention. 

The invention also relates to a method for the detection of HIV or SIV gp120 in a sample, comprising: 

(a) contacting a sample suspected of containing HIV or SIV gp120 with the fusion protein or 
immunoglobulin-like molecule of the invention; and 

(b) detecting whether a complex is formed. 

The invention also relates to a method for the detection of HIV or SIV gp120 in a sample, comprising 

(a) contacting a sample suspected of containing HIV or SIV gp120 with substantially pure non-human 
primate CD4, or fragment thereof which binds to HIV or SIV gp120. and 

(b) detecting whether a complex has formed. 

The invention is related to the discovery that non-human primates have CD4 of differing amino acid 
sequence than human CD4. The invention is also related to the discovery that when non-human primate 
CD4 is expressed on the surface of human cells, strikingly fewer multinucleated giant cells, or syncytia, are 
formed than when human CD4 is expressed on the surface of the celE The invention is also related to the* 
discovery that the presence of a glycine residue at position 87 in the non-human primate CD4 derived from 
the chimpanzee, instead of the glutamic acid residue as found in human CD4 t is responsible for the lack of 
syncytia formation. As a result, the C04 molecule derived from the chimpanzee can now be used in 
therapeutic application without the potential of causing syncytia formation. 

The invention is also related to the unexpected discovery that chimpanzee CD4 contains two glycosyla- 
te sites (positions 32 and 66 (ASN)). This discovery allows for the preparation of glycosylated gp120 
binding molecules and fragments thereof which bind to gp120 and likely have enhanced stability in vivo . 
Advantageously, the glycosylated gpl20 binding molecules and fragments thereof may be administered 
less frequently to an animal than human or other primate CD4 molecules which are not glycosylated. Thus, 
the invention also relates to, primate (including human) CD4 molecules having one or more glycosylation 
sites, for example, the chimp sequence at amino acid reidues 34 and 68, at 34 only, and at 68 only. The 
invention also relates to other CD4 molecules with glycosylation sites at different positions, so long as the 
molecule retains binding to gpl20. 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 



THe invention is directed to nucleic acid molecules specifying non-human primate CD4, HIV gp120 
binding fragments thereof. HIV gp120 binding soluble fragments thereof, fusion proteins thereof, and 
immunoglobulin-like molecules. The invention also relates to gp120 binding molecules capable of being 
glycosylated. HIV gp120 binding fragments thereof, fusion proteins thereof, and immunoglobulin-like 
molecules thereof. The nucleic acid molecules of the invention may be a DNA or RNA molecule. 

By the term "soluble" is intended that the CD4 fragment is soluble in aqueous solutions which include, 
but are not limited to, detergent-free aqueous buffers and body fluids such as blood, plasma and serum. 

The invention is also directed to the expression of these novel nucleic acid molecules in transformed 
hosts to give proteins and glycoproteins. The invention also relates to the use of these proteins and 
glycoproteins to treat and diagnose HIV infections. 

In particular, the invention relates to expressing said nucleic acid molecules, which specify a fusion 
protein comprising an immunoglobulin light or heavy chain, in mammalian hosts which express complemen- 
tary light or heavy chain immunoglobulins to give an immunoglobulin-like molecule which binds to HIV or 
SIV gp120. 

The CD4 proteins, glycoproteins. CD4 fragments, gp120 binding molecules, fusion proteins and 
immunoglobulin-like molecules of the invention may be administered to an animal for the purpose of 
treating HIV or SIV infections. By the terms "HIV infections" is intended the condition of having AIDS, AIDS 
related complex (ARC) or where an animal harbors the AIDS virus, but does not exhibit the clinical 
symptoms of AIDS or ARC. By the terms "SIV infections" is intended the condition of being infected with 
simian immunodeficiency virus. 

By the term "animal" is intended all animals which may derive benefit from the administration of the 
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CD4 proteins, glycoproteins, CD4 fragments, gp120 binding molecules, fusion proteins and Immunoglobulin- 
like molecules of the invention, foremost among such animals are humans, however, the invention is not 
intended to be so limited. 

By the term "fusion protein" is intended a fused protein comprising a CD4 molecule of the Invention, or 
fragment thereof which is capable of binding to gp120, linked at its Oterminus to an immunoglobulin chain 
wherein a portion of the N-terminus of the immunoglobulin is replaced with non-human primate CD4. 
Alternatively, the CD4 molecule or fragment thereof may be linked to a cytotoxic polypeptide such as ricin 
or diphtheria toxin. 

By the term "non-human primate" is intended any member of the suborder Anthropoidea except for the 
family Hominidaa Such non-human primates include the superfamiiy Ceboidea, family Cebidae (the New 
World monkeys including the capuchins, howlers, spider monkeys and squirrel monkeys) and family 
Callithricidae (including the marmosets); the superfamiiy Cercopithecoidea. family Cercopithecidae 
(including the macaques, mandrills, baboons, proboscis monkeys, mona monkeys, and the sacred hanuman 
monkeys of India); and superfamiiy Hominoidae, family Pongidae (including gibbons, orangutans, gorillas, 
and chimpanzees). The rhesus monkey is one member of the macaques. 

The nucleic acid molecules and proteins of the invention may be prepared according to the methods 
disclosed herein and according to well known methods of solid phase synthesis using the amino acid and 
DNA sequences disclosed herein. 

As described more fully in the examples below, the gly residue at position 87 of the CD4 derived from 
the chimpanzee differs from the Glu residue present in human CD4 which is responsible for syncytium 
formation. This discovery allows for the preparation of new CD4 molecules which do not mediate syncytium 
formation. An example of such a protein related to the chimpanzee CD4 molecule comprises the following 
amino acid sequence: 

MetAsnArgGly Val ProPheArgHi sLeuLeuLeuVal LeuGl nleuAl aLeuLeuPro 
AlaAlaThrGlnElyLysLysValVaUeuGlyLysLysGlyAspThrValGluLeuThr 
Cy sThrAl aSerGl nly sLysSer II eGl nPheHi sTrpLysAsnSerAsnGl n-e-lys 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArg-l- 
AspSerArgArgSerLeuTrpAspGlnGlyAsnPhe-S-LeuIlelleLysAsnLeuLys 
II eGl uAspSerAspThrTyrll eCysGI uVal GlyAspGl nLysGI uGl uVal G1 nLeu 
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LeuVal PheGlyLeuThrAl aAsnSerAspThrHI sLeuLeuGl nGlyGI nSerLeuThr 
LeuThrLeuGluSerProProGlySerSerProSerValGlnCysArgSerProArgGly 
LysAsnI1eG1nG1yG1yLy$ThrLeuSerVa1SerGlnLeuG1uLeuG1nAspSerG1y 
ThrTrpThrCysThrVal LeuGI nAsnGI nLysLysVal G1 uPheLys II eAspIl eVal 
Val LeuAl aPheGI nLysAl aSerSerll eValTyrLysLysGl uGlyGI uGl nVal G1 u 
PheSerPheProLeuAl aPheThrValGluLysLeuThrGlySerGlyGI uLeuTrpTrp 
G1 nAI aGI uArgAl aSerSerSerLysSerTrpIleThrPheAspleulysAsnlysGl u 
Val SerVal LysArgVal ThrGl nAspProLysleuGI nNetGlyLysLysLeuProLeu 
H1sLeuThrLeuProG1nA1aLeuProG1nTyrAlaGlySerG1yAsnLeuThrLeuA1a 
LeuGluAlaLysThrG1yLysLeuH1sG1nG1uVa1AsnLeuVa1ValHetArgAlaThr 
Gl nLeuGl nLysAsnLeuThrCysGl uVal TrpGlyProThrSerProLysLeuMetLeu 
SerLeuLysLeuGluAsnLysGluAlaLysValSerLysArgGluLysAlaValTrpVal 
LeuAsnProGluAlaGlyMetTrpGlnCysLeuLeuSerAspSerGlyGlnValLeuLeu 
Gl uSerAsn II e LysVal LeuProThrTrpSerThrProVal Gl nProMet Al aLeuIle 
ValLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlyllePhePheCysVal 
ArgCysArgHi sArgArgArgGl nAI a-%- ArgMetSerGl nil eLysArgLeuleuSer 
GluLysLysThrCysGlnCysProHisArgPheGlnLysThrCysSerProIle, 

wherein 

-@- is Thr or lie, 
-#- is Val or Ala, 

is Thr or Pro, and 
-%• is Gin or Glu, 

or the glycosylated derivative thereof. 

The recombinant DNA molecules which encode this family of proteins and glycoproteins have the 
following sequence: 

1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCACTCCTCCCA 
GCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGACACAGTGGAACTGACC 
121 TGTACAGCTTCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGAYAAAG 
ATTCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGYT 
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24 1 6ACTCAAGAAGAAGCCTTTGGGACCAAGGAAACTTTWCCCT6ATCATCAAGAATCTTAAG 

ATAGAAGACTCAGATACTTACATCTGTGAAGTGGGG6ACCAGAAGGAG6AGGTGCAATTG 
361 CTAGTGTTCGGAT7GACTGCCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCC7GACC 

CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAAGGGGT 
481 AAAAACATACAGGGGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGC 

ACCTGGACATGCACTGTCTTGCAGAACCAGAAGAAAGTGGAGT7CAAAA7AGACATCGTG 
601 GTGCTAGCTTTCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 

TTCTCCTTCCCACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 
72 1 CAGGCGGAGAGGGCTTCCTCCTCCAAGTCTTGGA7CACCTT7GACCTGAAGAACAAGGAA 

GTGTCTGTAAAACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 
84 1 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCC 

CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTCGTGGTGATGAGAGCCACT 
961 CAGCTCCAGAAAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTG 

AGCTTGAAACTGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGAGAAGGCGGTGTGGGTG 
1081 CTGAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTG 

6AATCCAACATCAAGGTTCTGCCCACATGGTCCACCCCGGTGCAGCCAATGGCCCTGATT 
1201 GTGCTGGGGGGCGTCGCCGGCCTCCTGCTTTTCATTGGGCTAGGCATCT7CTTCTGTGTC 

AGGTGCCGGCACCGAAGGCGCCAAGCASAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 
1321 GAG AAG AAGAC CTGC C AGTGC C CTCAC CGGTTTC AGAAGACATGTAGC C C C ATTTG A 

wherein Y is C or T, 
W is A or C, and 
S is C or G; 

or a degenerate variant thereof. 

In general, for the preparation of fusion proteins comprising an immunoglobulin, that portion of 
immunoglobulin which is deleted is the variable region. The fusion proteins of the invention may also 
comprise immunoglobulins where more than just the variable region has been deleted and replaced with the 
CD4 molecule or HIV gp120 binding fragment thereof, for example, the V H and CHI regions of an 
immunoglobulin chain may be deleted. In practice, any amount of the H-terminus of the immunoglobulin 
heavy chain can be deleted as long as the remaining fragment mediates cell death by antibody effector 
function or other mechanism. The minimum sequence required for binding complement encompasses 
domains CH2 and CH3. Joining of Fc portions by the hinge region is advantageous for increasing the 
efficiency of complement binding. 

The CD4 molecules of the invention and fusion proteins thereof may comprise the complete CD4 
sequence, the 372 amino acid extracellular region and the membrane spanning domain, or just the 
extracellular region. Moreover, the fusion proteins may comprise fragments of the extracellular region which 
retains binding to HIV gp120. The extracellular domain of CD4 consists of four contiguous regions each 
having amino acid and structural similarity to the variable and joining (V-J) domains of immunoglobulin light 
chains as well as related regions in other members of the immunoglobulin gene superfamlly. These 
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structurally similar regions of CD4 are termed the V tl V 2| V 3 and V* domains. See PCT Application 
Publication Number WO 89/02922 (published October 3, 1988). Thus, the non-human primate CD4 and 
fusion proteins thereof may comprise any combination of such binding regions. In general, any fragment of 
the CD4 proteins and glycoproteins of the invention may be used as long as they retain binding to gp120. 

s Gp120 binding CD4 fragments may be obtained by cutting the DNA sequence which encodes 
chimpanzee CD4 at the Nhe site at position 603 (to give a molecule which encodes two binding domains) or 
the BspM1 site at position 405 (to give a molecule which encodes one domain). Alternatively, the DNA 
molecule encoding rhesus CD4 may be cut at the Nhe site at position 603 (to give a molecule which 
encodes two domains) or the 6spM1 site at position 405 (to give a molecule which encodes one domain). 

to Other fragments may be obtained using, for example, an exonuclease. The DNA fragment can then be 
incorporated Into a cloning vector and introduced into a host, followed by screening the transformed host for 
the presence of a protein which binds gp120. Methods for screening clones for specific binding activity are 
well known to those of ordinary skill in the art. Preferably, such CD4 fragments are soluble in aqueous 
solution. 

is Where the fusion protein comprises an immunoglobulin light chain, it Is necessary that no more of the 
Ig chain be deleted than is necessary to form a stable complex with a heavy chain Ig. In particular, the 
cysteine residues necessary for disulfide bond formation must be preserved on both the heavy and light 
chain moieties. 

When expressed in a host, e.g., a mammalian celirthe fusion protein may associate witfTother light or 

20 heavy Ig chains secreted by the cell to give a functioning immunoglobulin-like molecule which is capable of 
binding to gp120. The gp120 may be in solution, expressed on the surface of infected cells, or may be 
present on the surface of the HIV virus itself. Alternatively, the fusion protein may be expressed in a 
mammalian cell which does not secrete other light or heavy Ig chains. When expressed under these 
conditions, the fusion protein may form a homodimer. 

25 Genomic or cDNA sequences may be used in the practice of the invention. Genomic sequences are 
expressed efficiently in myeloma cells, since they contain native promoter structures. 

The constant regions of the antibody cloned and used in the chimeric immunoglobulin-like molecule 
may be derived from any mammalian source. They may be complement binding or ADCC active. The 
constant regions may be derived from any appropriate isotype, including lgG1, igG3, or IgM. 

30 The joining of various DNA fragments, is performed in accordance with conventional techniques, 
employing blunt-ended or staggered-ended termini for ligation, restriction enzyme digestion to provide 
appropriate termini, filling in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and ligation with appropriate ligases. The genetic construct may optionally encode a 
leader sequence to allow efficient expression of the fusion protein. For example, the leader sequence 

35 utilized by M addon et al. , Cell 42 :93-104 (1985) for the expression of human CD4 may be used. 

For cDNA isolationTcDNA libraries may be screened, for example, by use of a complementary probe or 
by assay for the expressed CD4 molecule of the invention using a CD4-specific antibody. Methods for 
preparing antibodies by immunizing animals with an antigen are taught, for example, by Kohler and Milstein, 
Nature (London) 256 :495 (1975); Kohler et al Eur. J. Immunol. 6 :511 (1976); Kohler et aL . Eur. J. 

40 Immunol. 6 :292 (1976); or Hammerling et al ., in: Monoclonal Antibodies and T-Cell Hybridomas , Elsevier, 
N.V., pp.563-681 (1981). The invention further relates to monoclonal and polyclonal antibodies which are 
specific for the non-human CD4 proteins, glycoproteins of the invention, and the soluble and non-soluble 
fragments thereof. 

The non-human primate CD4 may be derived from any member of the suborder Anthropoidea except 
45 for the family Hominidae. Preferably, the non-human primate CD4 is derived from the rhesus monkey or 
chimpanzee, although the invention is not intended to be so limited. One of ordinary skill in the art can 
obtain tee CD4 from any additional primate by isolation of the poly-A containing RNA of mitogen stimulated 
peripheral blood mononuclear cells obtained from the particular animal. After preparation of cDNA with, for 
example, reverse transcriptase, the cDNA may be ligated into an appropriate cloning vector and used to 
so transform an appropriate host. The clones may then be screened with a monoclonal antibody directed to 
the rhesus monkey or chimpanzee CD4 of the invention followed by selection of positive clones, or by 
hybridization with the chimp or rhesus CD4 cDNAs. 

To express the CD4 molecules and fusion hybrid proteins of the invention, transcriptional and 
translational signals recognized by an appropriate host element are necessary. Eukaryotic hosts which may 
55 be used include mammalian cells capable of culture In vitro , particularly leukocytes, more particularly 
myeloma cells or other transformed or oncogenic lymphocytes, e.g., EBV-transformed cells. Advanta- 
geously, mammalian cells are used to express the glycosylated CD4 proteins. Alternatively, non-mammalian 
cells may be employed, such as bacteria, fungi, e.g., yeast, filamentous fungi, or the like. 
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Preferred hosts for fusion protein production are mammalian cells, grown in vitro in tissue culture or in 
vivo in animals. Mammalian cells provide post translational modification to immunoglobulin protein mol- 
icules which provide for correct folding and giycosylation of appropriate sites. Mammalian cells which may 
be useful as hosts include cells of fibroblast origins such as VERO or CH0-K1 or cells of lymphoid origin, 

5 such as the hybridoma SP2/0-AQ14 or the myeloma P3x63Sgh, and their derivatives. For the purpose of 
preparing an immunoglobulin-like molecule, a plasmid containing a gene which encodes a heavy chain 
immunoglobulin, wherein the variable region has been replaced with one of the CD4 molecules of the 
invention, may be introduced, for example, into J558L myeloma cells, a mouse plasmacytoma expressing 
the lambda-1 light chain but which does not express a heavy chain (see Oi et ah , P.N.A.S. (USA) 80 :825- 

io 829 (1983)). Other preferred hosts include COS cells, BHK cells and hepatoma cells. 

The constructs may be joined together to form a single DNA segment or may be maintained as 
separate segments, by themselves or in conjunction with vectors. 

Where the protein is not glycosylated, any host may be used to express the protein which is compatible 
with replication and transcription of sequences In the expression plasmid. In general, vectors containing 

75 replication and transcription controlling sequences are derived from species compatible with a host cell are 
used in connection with the host. The vector ordinarily carries a replication origin, as well as specific genes 
which are capable of providing phenotypic selection in transformed cells. The expression of the non-human 
primate CD4 molecules and fusion proteins can also be placed under control with other regulatory 
sequences which may be homologous to" the organism in its untransformed state. For example, lactose- 

20 dependent E. coil chromosomal DNA comprises a lactose or lac operon which mediates lactose utilization 
by elaboratinglhe enzyme beta-gaiactosidase. The lac control elements may be obtained from bacterial 
phage lambda placs, which is infective for E. coii . The lac promoter-operator system can be induced by 
IPTG. 

Other promoters/operator systems or portions thereof can be employed as well. For example, colicin 

25 E1 , galactose, alkaline phosphatase, tryptophan, xylose, tax , and the like can be used. 

For mammalian hosts, several possible vector systems are available for expression. One class of 
vectors utilize DNA elements which are derived from animal viruses such as bovine papilloma virus, 
polyoma virus, adenovirus, vaccinia virus, baculovirus, retroviruses (RSV, MMTV or MOMLV), or SV40 virus. 
Cells which have stably integrated the DNA into their chromosomes may be selected by introducing one or 

30 more markers which allow selection of transfected host cells. The marker may provide for prototropy to an 
auxotrophic host, biocide resistance, e.g., antibiotics, or heavy metals such as copper or the like. The 
selectable marker gene can be either directly linked to the DNA sequences to be expressed, or Introduced 
into the same cell by cotransformatlon. Additional elements may also be needed for optimal synthesis of 
mRNA. These elements may include splice signals, as well as transcriptional promoters, enhancers, and 

35 termination signals. The cDNA expression vectors incorporating such elements includes those described by 
Okayama, H., MoL CeL BioL . 3 :280 (1983) and others. 

Once the vector or DNA sequence containing the constructs has been prepared for expression, the 
DNA constructs may be introduced to an appropriate host. Various techniques may be employed, such as 
protoplast fusion, calcium phosphate precipitation, electroporation or other conventional techniques. After 

40 the fusion, the cells are grown lin media and screened for the appropriate activity. Expression of the gene(s) 
results in production of the desired protein. If the expressed product is a fusion protein, it may then be 
subject to further assembly with an immunoglobulin light or heavy chain to form an immunoglobulin-like 
molecule. 

The host cells for CD4 protein and glycoprotein, CD4 fragment, and immunoglobulin production may be 
45 immortalized cells, primarily myeloma or lymphoma cells. These cells may be grown in appropriate nutrient 
medium in culture flasks or injected into a synergistic host, e.g„ mouse or a rat, or immunodeficient host or 
host site, e.g., nude mouse or hamster pouch. In particular, the ceils may be introduced into the abdominal 
cavity of an animal to allow production of ascites fluid which contains the immunoglobulin-like. molecule. 
Alternatively, the celts may be injected subcutaneously and the chimeric antibody is harvested from the 
so blood of the host. The cells may be used in the same manner as hybridoma cells. See Diamond et ah , N. 
Eng. J. Med. 304 :1344 (1981), and Kennatt. McKearn and Bechtol (Eds.). Monoclonal Antibodies: 
Hyb~rid~omaiT ^ A New Dimension in Biologic Analysis , Plenum, 1980. 

The CD4"proteins. glycoproteins, CD4 fragments, fusion proteins and immunoglobulin-like molecules of 
the invention may be isolated and purified in accordance with conventional conditions, such as extraction, 
55 precipitation, chromatography, affinity chromatography, electrophoresis or the like. For example, the CD4 
proteins, glycoproteins and fragments may be purified by passing a solution thereof through a column 
having gpl20 immobilized thereon (see U.S. patent No. 4,725,669). The bound CD4 molecule may then be 
eluted by treatment with a chaotropic salt or by elution with aqueous acetic acid (1M). 
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The Ig fusion proteins may be purified by passing a solution containing the fusion protein through a 
column which contains immobilized protein A or protein G which selectively binds the Fc portion of the 
fusion protein. See, for example, Reis, K.J., et aL , J. Immunol 132 :3098-3102 (1984); PCT Application, 
Publication No. W087/00329. The chimeric antibody may the be eluted by treatment with a chaotropic salt 
or by elution with aqueous acetic acid (1 M ). 

Alternatively the non-human primate CD4 proteins and glycoproteins, fragments, fusion proteins and 
immunoglobulin-like molecules may be purified on anti-CD4 antibody columns, or on anti-immunoglobulln 
antibody columns to give a substantially pure protein. 

By the term "substantially pure" is intended that the protein is free of the impurities that are naturally 
associated therewith. Substantial purity may be evidenced by a single band by electrophoresis. 

In one embodiment of the invention, cDNA sequences which encode the CD4 molecules of the 
invention, or a fragment thereof which binds gp120, may be ligated into an expression plasmid which codes 
for an antibody wherein the variable region of the gene has been deleted. Methods for the preparation of 
genes which encode the heavy or light chain constant regions of immunoglobulins are taught, for example, 
by Robinson, R. et aL , PCT Application, Publication No. WO87-02671. The cDNA sequence encoding the 
CD4 molecule or fragment may be directly joined to the cDNA encoding the light or heavy Ig contant 
regions or may be joined via a linker sequence. Preferably, the linker sequence does not encode a protein 
product which gives rise to an antigenic reaction in the individual. 

Preferred immunoglobulin-like molecules which contain the CD4 molecules of the invention, or frag- 
ments thereof, contain the constant region of an IgM, lgG1 or lgG3 antibody. 

The CD4 proteins, glycoproteins, fragments, fusion proteins and immunoglobulin-like molecules, and 
pharmaceutical compositions thereof may be used for the treatment or prophylaxis of HIV viral infections. 
This method comprises administering to an animal an effective amount of the CD4 proteins, glycoproteins, 
fragments, fusion proteins and immunoglobulin-like molecules, and pharmaceutical compositions thereof, 
which are capable of specifically forming a complex with gp120 so as to render the HIV or SIV, with which 
the individual is infected, incapable of infecting T4* cells. 

The fusion protein and immunoglobulin-like molecule may complex to gp120 which is expressed on 
infected cells. Although the inventor is not bound by a particular theory, it appears that the Fc portion of the 
fusion protein or immunoglobulin-like molecule may bind with complement to mediate destruction of the 
cell. In this manner, infected cells are destroyed so that additional viral particle production is stopped. 

For the purpose of treating HIV infections, the non-human primate CD4 molecules or fragments thereof, 
fusion proteins or immunoglobulin-like molecules of the invention may additionally contain a radiolabel, 
therapeutic agent or cytotoxic polypeptide which enhances destruction of the HIV particle or HIV-infected 
cell. 

Examples of radioisotopes which can be bound to the proteins, glycoproteins, fusion proteins, and 
immunoglobulin-like molecules of the invention for use In HIV-therapy are 125 l, 131 I, 90 Y, 67 Cu, 217 Bi, 2 211 At, 
2i2pfc>, 47$c, and 109 Pd. Optionally, a label such as boron can be used which emits a and 0 particles upon 
bombardment with neutron radiation. 

For in vivo diagnosis radionucleotides may be bound to the CD4 proteins, glycoproteins or fragments 
thereof, "fusion proteins or immunoglobulin-like molecules either directly or by using an intermediary 
functional group. An intermediary group which is often used to bind radioisotopes, which exist as metallic 
cations, to antibodies is diethyienetriaminepentaacetic acid (DTPA). Typical examples of metallic cations 
which are bound in this manner are Mm Tc ,23 l. ni ln. 13l l, 97 Ru, e7 Cu, 67 Ga, and 6 *Ga. 

Moreover, the CD4 proteins and glycoproteins or fragments thereof, fusion proteins and 
immunoglobulin-like molecules may be tagged with an NMR Imaging agent which include paramagnetic 
atoms. The use of an NMR imaging agent allows the in vivo diagnosis of the presence of and the extent of 
HIV infection within a patient usingNMR techniques. Bements which are particularly useful in this manner 
are 157 Gd, 55 Mn. l62 Dy, 52 Cr, and s6 Fe. 

Introduction of the nucleic acid molecules of the invention by gene therapy may also be contemplated, 
for example, using retroviruses or other means to introduce the genetic material specifying the fusion 
proteins into suitable target tissues. In this embodiment, the target tissues having the nucleic acid 
molecules of the invention may then produce the CD4 molecules or fusion protein in vivo . 

The nucleic acid molecules specifying the CD4 molecules or fragments thereof may be used to 
reconstitute the immune system of an individual suffering from HIV. For example, the bone marrow cells of 
an HIV-infected individual may be removed and the hematopoietic stem cells, either as part of a mixed 
population or a purified fraction, may be infected or transfected with a virus or DNA construct that specifies 
the non-human primate CD4 or fragment thereof. Production of human CD4 may be shut down by including 
within the same or different genetic construct, a gene which interferes with the expression of human CD4. 
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Such a gene may take many forms, for example, it may encode RNA that binds to a regulatory protein 
(since the non-human primate CD4 may be under other control, its expression will not be affected); an 
antisense RNA that binds selectively to the human CD4 gene; or a DNA-blnding protein that has had its 
regulatory region amputated. The modified stem cells would then be injected back into the patient where 
they will migrate to the bone marrow. Preferably, the marrow would have been previously cleared of normal 
hematopoietic cells by irradiation or with a toxic drug. See Baltimore, D. Nature 335 :395-396 (1988). 

Methods for the transfection of hematopoietic cells are well known and taught for example, by 
Wetherall, D.J., Nature 331 :13-14 (1988); Dick, J.E., Ann. N. Y. Acad. Sci. 507 :242-251 (1987); Eglitis, D.B. 
et al ., Science 230 :1 395-1398 (1985); Gillio, A. et ah , Ann. I^Y. Acad. Sci. 511 :406-417 (1987). Methods 
for the transfection of cells with anti-sense RNA are taught, for example, by Hambor, J.E. et a! ., Proc. Natl. 
Acad. Sci. (USA) 85 :4010-4014 (1988); Sanford, J.C., J. Theor. Biol. 130 :469-480 (1988);Tzant, JiGTetaTT 
Science 229 :345-352 (1985); and Hambor, J.E. et aL , J. Exa. Med. 168 :1237-1245 (1988). 

The non-human primate CD4, and soluble and non-soluble fragments thereof which bind HIV or SIV 
gp120, may also be used in vivo to treat HIV infection by blocking infection of human CD4 bearing 
lymphocytes and syncytium formation. See Lui, M. et aJ J. Clin. Invest.82 :21 76-21 80 (1988) or Fischer, 
R.A. et al. , Nature 331 :76-78 (1988) for a discussion on the use of human CD4 and soluble fragments 
thereof to block HIV infection of CD4 bearing lymphocytes and syncytium formation. 

Fusion proteins comprising the CD4 proteins, glycoproteins and fragments thereof, and a therapeutic 
agent can also be used to treat HIV infected individuals by killing HIV-infected "cells in vivo . Therapeutic 
agents may include, for example, cytotoxic polypeptides such as the bacterial toxins" diphtheria toxin or 
ricin. Methods for producing fusion proteins comprising fragment A of diphtheria toxin are taught in U.S. 
Patent 4,675,382 (1987) which is incorporated by reference herein. Diphtheria toxin contains two polypep- 
tide chains. The B chain binds the toxin to a receptor on a cell surface. The A chain actually enters the 
cytoplasm and inhibits protein synthesis by inactivating elongation factor 2, the factor that translocates 
ribosomes along mRNA concomitant with hydrolysis of ATP. See Darnell, J., et al ., in Molecular Cell 
Biology ( Scientific American Books, Inc., page 662 (1986). Alternatively, a fusion protein comprising ricinTI 
toxic lectin, may be prepared. Methods for the preparation of a fusion protein comprising human CD4 linked 
to active regions of Pseudomonas endotoxin A and the use thereof to selectively kill HIV infected cells are 
taught by Chaudhary, V.K. et al. , Nature 335 :369-372 (1988), which is incorporated by reference herein. 

The dose ranges for the administration of the CD4 proteins, glycoproteins and fragments thereof, fusion 
proteins and immunoglobulin-like molecules are those which are large enough to produce the desired effect 
whereby the symptoms of HIV or SIV infection are ameliorated. The dosage should not be so large as to 
cause adverse side effects, such as unwanted cross-reactions, anaphylactic reactions, and the like. 
Generally, the dosage will vary with the age, condition, sex and extent of disease In the patient, counter 
indications, If any, immune tolerance and other such variables, to be adjusted by the individual physician. 
Dosage can vary from .001 mg/kg to 50 mg/kg, preferably 0.1 mg/kg to 1.0 mg/kg, of the CD4 molecule of 
the invention, gp120 binding molecule, or fragment thereof, fusion protein, or immunoglobulin-like molecule, 
in one or more administrations daily, for one or several days. The immunoglobulin-like molecule can be 
administered parenterally by injection or by gradual perfusion over time. They can be administered 
intravenously, intraperitoneally, intramuscularly, or subcutaneously. 

Preparations for parenteral administration include sterile or aqueous or non-aqueous solutions, suspen- 
sions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers include 
water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Paren- 
teral vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated 
Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishes, electrolyte replenishers, 
such as those based on Ringer's dextrose, and the like. Preservatives and other additives may also be 
present, such as, for example, antimicrobials, anti-oxidants, chelating agents, inert gases and the like. See, 
generally, Remington's Pharmaceutical Science , 16th Ed., Mack Eds., 1980. 

The invention also relates to a method for preparing a medicament or pharmaceutical composition 
comprising the components of the invention, the medicament being used for therapy of HIV or SIV infection 
in animals. 

The proteins and glycoproteins of the present invention may also be used in combination with other 
therapeutics used in the treatment of AIDS, ARC and HIV infection. For example, the proteins and 
glycoproteins may be co-administered with anti-retroviral agents that block reverse transcriptase such as 
AZT, DDI, HPA-23, phosphonoformate, suramin, ribavirin and deoxycytidine. Alternatively, the proteins and 
glycoproteins of the invention may be co-administered with such anti-viral agents as interferons, including 
alpha interferon, gamma interferon, omega interferon, or glucosidase inhibitors such as castanospermine. 
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Such combination therapies may advantageously utilize lower dosages of those agents so as to minimize 
toxicity and enhance the effectiveness of the treatment 

The detection and quantitation of antigenic substances and biological samples frequently utilizes 
immunoassay techniques. These techniques are based upon the formation of the complex between the 
6 antigenic substance, e.g., gp120, being assayed and an antibody or antibodies in which one or the other 
member of the complex may be detectably labeled. In the present invention, the CD4 proteins, 
glycoproteins or fragments thereof, immunoglobulin-like molecules or fusion proteins may be labeled with 
any conventional label. 

Thus, the CD4 protein, glycoprotein or fragment thereof, fusion protein or immunoglobulin-like molecule 
w can also be used in assay for HIV or SIV viral infection in a biological sample by contacting a sample, 
derived from an animal suspected of having an HIV or SIV infection, with the CD4 protein, glycoprotein or 
fragment thereof, fusion protein or immunoglobulin-like molecule, and detecting whether a complex with 
gp120, either alone or on the surface of an HIV-infected cell, has formed. 

For example, a biological sample may be treated with nitrocellulose, or other solid support which is 
75 capable of immobilizing cells, cell particles or soluble protein. The support may then be washed with 
suitable buffers followed by treatment with the CD4 protein, glycoprotein or fragment thereof, fusion protein, 
or immunoglobulin-like molecule, any of which may be detectably labeled. The solid phase support may 
then be washed with a buffer a second time to remove unbound protein and the label detected. 

In cartying~out the assay of the present invention on a sample containing gpi20, the process 
20 comprises: 

a) contacting a sample suspected of containing gp120 with a solid support to effect immobilization of 
gp120, or ceil which expresses gp120 on its surface; 

b) contacting said solid support with the detectably labeled CD4 protein, glycoprotein or fragment thereof 
which binds to HIV gp120 f immunoglobulin-like molecule or fusion protein molecule of the invention; 

25 c) incubating said detectably labeled molecule with said support for a sufficient amount of time to allow 
the detectably labelled molecule to bind to the immobilized gp120 or cell which expresses gp120 on its 
surface; 

d) separating the solid phase support from the incubation mixture obtained in step c): and 

e) detecting the bound detectably labeled molecule and thereby detecting and quantifying gp120. 

30 Alternatively, the detectably labeled CD4 protein, glycoprotein or fragment thereof, immunoglobulin-like 
molecule or fusion protein - gp120 complex in a sample may be separated from a reaction mixture by 
contacting the complex with an immobilized antibody or protein which is specific for an immunoglobulin or, 
e.g., protein A, protein Q, anti-IgM or anti-IgG antibodies. Such antiimmunoglobulin antibodies may be 
monoclonal or polyclonal. The solid support may then be washed with suitable buffers to give an 

35 immobilized complex. The label may then be detected to give a measure of gpi20 and. thereby, the 
presence of HIV. 

This aspect of the Invention relates to a method for detecting HIV or SIV viral Infection in a sample 
comprising: (a) contacting a sample suspected of containing gp120 with a fusion protein comprising non- 
human primate CD4 or fragment thereof that binds to HIV gp120 and the Fc portion of an immunoglobulin 
40 chain, and 

(b) detecting whether a complex is formed. 

The invention also relates to a method of detecting gp120 in a sample, further comprising: 

(c) contacting the mixture obtained in step (a) with an Fc binding molecule, such as an antibody, protein 
A, or protein G. which is immobilized on a solid phase support and is specific for the fusion protein, to 

45 give a gpl20 fusion protein-immobilized antibody complex 

(d) washing the solid phase support obtained in step (c) to remove unbound fusion protein, and 

(e) and detecting the label on the fusion protein. 

Of course, the specific concentrations of detectably labeled immunoglobulin-like molecule (or fusion 
protein) and gp120, the temperature and time of incubation, as well as other assay conditions may be 

so varied, depending on various factors including the concentration of gpl20 in the sample, the nature of the 
sample, and the like. Those skilled in the art will be able to determine operative and optimal assay 
conditions for each determination by employing routine experimentation. 

Other such steps as washing, stirring, shaking, filtering and the like may be added to the assays as is 
necessary for the particular situation. 

55 One of the ways in which the CD4 protein, glycoprotein or fragment thereof, immunoglobulin-like 
molecule or fusion protein can be detectably labeled is by linking the same to an enzyme. This enzyme, in 
turn, when later exposed to Its substrate, will catalize the formation of a product which can be detected as, 
for example, by spectrophotometric, fluorometric or by visual means. Enzymes which can be used to 
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detectably label the CD4 protein, glycoprotein or fragment thereof, immunoglobulin-like molecule or fusion 
protein of the present invention include, but are not limited to, malate dehydrogenase, staphylococcal 
nuclease, deita-V-steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate dehydrogenase, 
triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, 

s beta-galactosidase, ribonuclease, urease, catalase, glucose-VI-phosphate dehydrogenase, glucoamylase 
and acetylcholine esterase. 

The CD4 protein, glycoprotein or fragment thereof, immunoglobulin-like molecule or fusion protein of 
the present invention may also be labeled with a radioactive isotope which can be determined by such 
means as the use of a gamma counter or a scintillation counter or by autoradiography. Isotopes which are 

w particularly useful for the purpose of the present invention are: 3 H, 12S I, ,31 l, 32 P, 3S S, 14 C. 51 Cr, 3B CI, S7 Co, 
58 Co, 59 Feand 75 Se. 

It is also possible to label the CD4 protein, glycoprotein or fragment thereof, immunoglobulin-like 
molecule or fusion protein with a fluorescent compound. .When the fiuorescently labeled immunoglobulin- 
like molecule is exposed to light of the proper wave length, Its presence can then be detected due to the 
75 fluorescence of the dye. Among the most commonly used fluorescent labelling compounds are fluorescein 
isothiocyanate, rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o -phthaldehyde and 
fluorescamine. 

The CD4 protein, glycoprotein or fragment thereof, immunoglobulin-like molecule or fusion prot ein of 
the invention can also be detectably labeled using fluorescence emitting metals such as tS2 Eu, or others of 

20 the lanthanide series. These metals can be attached to the CD4 protein, glycoprotein or fragment thereof, 
immunoglobulin-like molecule or fusion protein, using such metal chelating groups as 
diethylenetriaminepentaacetic acid (DTPA) or ethylenediaminetetraacetic acid (EDTA). 

The CD4 protein, glycoprotein or fragment thereof, immunoglobulin-like molecule or fusion protein of 
the present invention also can be detectably labeled by coupling it to a chemi luminescent compound. The 

2S presence of the chemiluminescent-tagged CD4 protein, glycoprotein or fragment thereof, immunoglobulin- 
like molecule or fusion protein is then determined by detecting the presence of luminescence that arises 
during the course of a chemical reaction. Examples of particularly useful chemiluminescent labeling 
compounds are luminol, isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. 
Likewise, a bioluminescent compound may be used to label the CD4 protein, glycoprotein or fragment 

30 thereof, immunoglobulin-like molecule or fusion protein of the present invention. Bioluminescence is a type 
of chemiluminescence found in biological systems in which a catalytic protein increases the efficiency of 
the chemiluminescent reaction. The presence of a bioluminescent protein is determined by detecting the 
presence of luminescence. Important biolum inescent compounds for purposes of labeling are luciferin, 
luciferase and aequorin. 

35 Detection of the CD4 protein, glycoprotein or fragment thereof, immunoglobulin-like molecule or fusion 
protein may be accomplished by a scintillation counter, for example, if the detectable label is a radioactive 
gamma emitter, or by a fluorometer, for example, if the label is a fluorescent material. In the case of an 
enzyme label, the detection can be accomplished by colorimetric methods which employ a substrate for the 
enzyme. Detection may also be accomplished by visual comparison of the extent of enzymatic reaction of a 

40 substrate in comparison with similarly prepared standards. 

The assay of the present invention is ideally suited for the preparation of a kit. Such a kit may comprise 
a carrier means being compartmentalized to receive in close confinement therewith one or more container 
means such as vials, tubes and the like, each of said container means comprising the separate elements of 
the immunoassay. For example, there may be a container means containing a solid phase support, and 

45 further container means containing the detectably labeled CD4 protein, glycoprotein or fragment thereof, 
immunoglobulin-like molecule or fusion protein. Further container means may contain standard solutions 
comprising serial dilutions of analytes such as gp120 or fragments thereof to be detected. The standard 
solutions of these analytes may be used to prepare a standard curve with the concentration of gp120 
plotted on the abscissa and the detection signal on the ordinate. The results obtained from a sample 

so containing gp120 may be interpolated from such a plot to give the concentration of gp120. 

The CD4 protein, glycoprotein or fragment thereof, immunoglobulin-like molecule or fusion protein of 
the present invention can also be used as a stain for tissue sections. For example, a labeled molecule 
comprising CD4 protein or glycoprotein or HIV gp120 binding fragment thereof, may be contacted with a 
tissue section, e.g., a brain biopsy specimen. This section may then be washed and the label detected. 

55 The following examples are illustrative, but not limiting the method and composition of the present 
invention. Other suitable modifications and adaptations which are obvious to those skilled in the art are 
within the spirit and scope of this invention. 
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EXAMPLES 

5 EXAMPLE 1 ISOLATION OF CHIMPANZEE AND RHESUS MONKEY CD4 cDNAs 

cDNA clones encoding the CD4 antigens of the Chimpanzee ( Pan troglodytes ) and the Rhesus 
Monkey ( Maccaca mulatta ) were isolated, sequenced, and expressed. Non-human primate CD4 cDNAS 
were synthesized from the poly-A containing RNA of mitogen stimulated peripheral blood mononuclear cells 

jo obtained from these animals. cDNA expression libraries were made in the vector CDM8 and CD4 cDNAS 
we isolated by four rounds of immunoselection as previously described by Seed et al Proc. Natl. Acad. 
Sci (USA) 84 :3365-3369 (1987). Sequencing was carried out using the dideoxy nucleotide chain termination 
technique on single and double stranded templates. The DNA and amino acid sequences of the Chim- 
panzee and Rhesus Monkey CD4 are shown below. Also shown is a comparison of the respective 

75 sequences to human CD4. 

RHESUS CD4 CODING SEQUENCE AND PREDICTED AMINO ACID SEQUENCE SHOWING 
DIFFERENCE"SFROM HUMAN SEQUENCES 

20 



1 AT6AACCGGGGAATCCCTTTTAGGCACTT6CTTCTGGTGCTGCAACTGGCGCTACTCCCA 
25 -25 MetAsnArgGly II eProPheArgHi sleuLeuLeuVal LeuGl nLeuAl aLeuLeuPro 
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50 
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Val 

G C 

1 

6CAGTCACCCAG6SAAAGAAAGTGGTGCTGGGCAA6AAAGGGGATACA6TG6AACT6ACC 
AlaValThrGlnGlyLysLysValVaUeuGlyLysLysGlyAspThrValGluLeuThr 
Ala 

C T A 



1 2 1 TGTACAGCTTCGCAGAAGAAGAACACACAATTCCACTGGAAAAACTCCAACCAGATAAAG 
1 6 CysThrAl aSerGl nLysLysAsnThrGl nPheHi sTrpLy sAsnSerAsnGI n II eLys 

Serlle 



G T 



ATTCTGGGAATTCAGGGTCTCTTCTTAACTAAAGGTCCATCCAAGCTGAGCGATCGTGCT 
IleLeuGlyneGlnGlyLeuPheLeuThrLysGlyProSerLysLeuSerAspArgAla 
Asn Ser Asn 
A CTC AT C 



24 1 GACTCAAGAAAAAGCCTTTGGGACCAAGGATGCTTTTCCATGATCATCAAGAATCTTAAG 
56 AspSerArgLysSerLeuTrpAspGlnGlyCysPheSerMetllelleLysAsnLeuLys 
Arg Asn ProLeu 



G AA CC C 

* . 

ATAGAAGACTCAGATACTTACATCTGTGAAGTGGAGAACAAGAAGGAGGAGGTGGAATTG 
II eGl uAspSerAspThrTyrll eCysGl uVal Gl uAsnLysLysGI uGl uVal Gl uLeu 

AspGIn Gin 
G C C 



361 CTGGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTTGAGGGGCAAAGCCTGACC 
96 LeuValPheGlyLeuThrAlaAsnSerAspThrHlsLeuLeuGluGlyGlnSerLeuThr 

Gin 
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CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGAAATGTAGGAGTCCAGGGGGT 480 
LeuThrLeuGl uSerProProGlySerSerProSerVal LysCysArgSerProGlyGly 135 

Gin Arg 
C A 



481 AAAAACATACAGGGGGGGAGGACCATCTCT6TGCCTCAGCTGGAGCGCCAGGATAGTGGC 
136 LysAsnneGlnGlyGlyArgThrlleSerValProGlnLeuGluArgGlnAspSerGly 

Lys Leu Ser Leu 

A C C T T 

ACCTGGACATGCACCGTCTCGCAGGACCAGAAGACGGTGGAGTTCAAAATAGACATCGTG 600 
ThrTrpThrCysThrVal SerGI nAspGI nLysThrVal Gl uPheLys I 1 eA$p 11 eVal 175 
Leu Asn Lys 

— T T— A A 

601 GTGCTAGCTTTCCAGAAGGCCTCCAGCACAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 
176 Val LeuAl aPheGl nLysAl aSerSerThrValTyrLysLysGl uGlyGI uGl nVal Gl u 

He 



TTCTCCTTCCCACTCGCCTTTACACTTGAAAAGC7GACGGGCAGTGGCGAGCTGTGGTGG 720 
PheSerPheProLeuAlaPheThrLeuGluLysLeuThrGlySerGlyGluLeuTrpTrp 215 

Val 

G 



721 CAGGCGGAGAGGGCCTCCTCCTCCAAGTCTTGGATTACCTTCGACCTGAAGAACAAGGAA 
216 GlnAlaGluArgAlaSerSerSerLysSerTrpIleThrPheAspLeuLysAsnLysGlu 



GTGTCTGTAAAACGGGTTACCCAGGACCCCAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 840 
ValSerValLysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 255 



841 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACGCTGGCC 
256 HI $LeuTk.'LeuProGl nAl aLeuProGl nTyrAl aGlySerGlyAsnLeuThrLeuAl a 
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CTTGAA6CGAAAACAGGAAAGTTGCATCAG6AA6TGAACCTCGTGGT6AT6AGA6CCACT 960 
LeuGl uAl aLysThrGlyLysLeuHl sGl nG1 uVal AsnLeuVal ValMetArgAl aThr 295 



961 CAGTTCCAGGAAAATTTGACCTGTGAAGTGTGGGGACCCACCTCCCCTAAGCTGACGCTG 
296 61 nPheGI nGI uAsnLeuThrCysGI uValTrpGlyProThrSerProLysLeuThrLeu 
Leu Lys Met 



AGCTTGAAACTGGAGAACAAGGGGGCAACGGTCTCGAAGCAGGCGAAGGCGGTGTGGGTG 1 080 
SerLeuLysLeuGluAsnLysGlyAlaThrValSerLysGlnAlaLysAlaValTrpVal 335 

Glu Lys ArgGlu 
A A -G- A 

1 08 1 CTGAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTA 
336 LeuAsnProGI uAl aGlyHetTrpGl nCysLeuLeuSerAspSerGlyGI nVal LeuLeu 



GAATCCAACATCAAGGTTGTGCCCACATGGCCCACCCCGGTGCAGCCAATGGCCCTGATT 1 200 
GluSerAsnlleLysValValProThrTrpProThrProValGlnProHetAlaLeune 375 

Leu Ser 

C T 



1201 GTGCTGGGGGGCGTTGCGGGCCTCC7GCTTTTCACTGGGCTAGGCATCTTCTTCTGTGTC 
376 VaHeuGlyGlyValAlaGlyLeuLeuLeuPheThrGlyLeuGlyllePhePheCysVal 
lie 

C C T 



AGGTGCCGGCATCGAAGGCGTCAAGCAGAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 1320 
ArgCysArgHi sArgArgArgGI nAl aGl uArgMetSerGl n 1 1 eLysArgLeuLeuSer 415 



1321 GAAAAGAAGACCTGCCAGTGCCCTCACCGGTTTCAGAAGACATGTAGCCCCATTTGA 1377 
416 GluLysLysThrCysGlnCysProHisArgPheGlnLysThrCysSerProIleEnd 433 



CHIMP CD4 CODING SEQUENCE AND PREDICTED AMINO ACID SEQUENCE SHOWING 
DIFFERENCES FROM HUMAN SEQUENCES 
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1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACT66CACTCCTCCCA 
-25 MetAsnArgGlyValProPheArgHisLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

G 

1 

GCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGACACAGTGGAACTGACC 
Al aAl aThrGl nGlylysLysVal Val LeuGlyLysLysGlyAspThrVal G1 uLeuThr 

A T 



^ AAAAAAAAA 

121 TGTACAGCTTCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGACAAAG 
1 6 CysThrAl aSerGI nLy sLysSer II eGl nPheM sTrpLysAsnSerAsnGl nThrLys 



T 



AnCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGTT 
IleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgVal 

Ala 
C 



AAAAAAAAA # . 

24 1 GACTCAAGAAGAAGCCTTTGGGACCAAGGAAACTTTACCCTGATCATCAAGAATCT7AAG 
56 AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeuIlelleLysAsnLeuLys 

CC 
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10 



75 



ATAGAAGACTCAGATACTTACATCTGTGAAGTGGGGGACCAGAAGGAGGAGGTGCAATTG 360 
IleGI uAspSerAspThrTyrll eCysGl uVal GlyAspGl nLysGl uGl uVal GlnLeu 95 

61 u 
A 

361 CTAGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCCTGACC 
96 LeuValPheGlyLeuThrAUAsnSerAspThrHlsLeuLeuGlnGlyGlnSerLeuThr 

CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAAGGGGT 360 
LeuThrLeuGluSerProProGlySerSerProSerValGlnCysArgSerProArgGly 135 

481 AAAAACATACAGGGGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGC 
136 LysAsnlleGlnGlyGlyLysThrLeuSerValSerGlnLeuGluLeuGlnAspSerGly 

.* • . . . . . 

»> ACCTGGACATGCACTGTCTTGCAGAACCAGAAGAAAGTGGAGTTCAAAATAGACATCGTG 600 

ThrTrpThrCysThrVal LeuGI nAsnGI nLysLys Val G1 uPheLys II eAsp II eVal 175 



601 GTGCTAGCTTTCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 
1 76 Val LeuAl aPheGI nLysAI aSerSer II eVal TyrLysLysGl uGlyGI uGl nVal G1 u 

so TTCTCCTTCCCACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 720 

PheSerPheProLeuAl aPheThrVal G1 uLysLeuThrGlySerGlyGl uLeuTrpTrp 215 

721 CAGGCGGAGAGGGCnCCTCCTCCAAGTCTTGGATCACCTTTGACCTGAAGAACAAGGAA 
3 s 216 Gl nAl aGl uArgAl aSerSerSerLysSerTrpIleThrPheAspLeuLysAsnLysGl u 

GTGTCTGTAAAACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 840 
Val SerVal LysArgVal ThrGl nAspProLysLeuGI nNetGlyLysLysLeuProLeu 255 

40 



45 



841 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCC 
256 H1sLeuThrLeuProG1nA1aLeuProGlnTyrA1aG1ySerG1yAsnLeuThrLeuA1a 



CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTCGTGGTGATGAGAGCCACT 840 
LeuGluAlaLysThrGlyLysLeuHlsGlnGluValAsnUuValValHetArgAlaThr 295 

G 



50 
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961 CAGCTCCAGAAAAATTT6ACCTGTGAG6T6T6GGGACCCACCTCCCCTAAGCTGATGCT6 
296 GInLeuGlnLysAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuMetLeu 

AGCTTGAAACTGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGAGAAGGCGGTGTGGGTG 1080 
SerleulysLeuGl uAsnLysGI uAl aLy sVal SerLysArgGI uLy sAI aVal TrpVal 335 



1081 CTGAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTG 
336 LeuAsnProGl uAl aGlyMetTrpGl nCysleuleuSerAspSerGlyGl nVal LeuLeu 



GAATCCAACATCAAGGTTCTGCCCACATGGTCCACCCCGGTGCAGCCAATGGCCCTGATT 1200 
Gl uSerAsn II eLy s Val LeuProThrTrpSerThrProVal Gl nProMetAl aLeu He 375 



1201 GTGCTGGGGGGCGTCGCCGGCCTCCTGCTTTTCATTGGGCTAGGCATCTTCTTCTGTGTC 
376 ValLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlyllePhePheCysVal 



AGGTGCCGGCACCGAAGGCGCCAAGCACAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 1320 
ArgCysArgHlsArgArgArgGlnAlaGlnArgMetSerGlnlleLysArgLeuLeuSer 415 

Glu 



1321 GAGAAGAAGACCTGCCAGTGCCCTCACCGGTTTCAGAAGACATGTAGCCCCATTTGA 1377 
416 GluLysLysThrCysGlnCysProHisArgPheGlnLysThrCysSerProIleEnd 433 

The chimpanzee CD4 antigen is 99% homologous to its human counterpart, possessing 5 amino acid 
substitutions in the 433 amino acid predicted mature polypeptide, while the rhesus monkey CD4 is 92% 
homologous having 34 divergences from the human CD4 amino acid sequence. Antigen expression was 
effected transiently in CDM8 as well as stably using the retroviral vector pMNCS. 



EXAMPLE 2 CHARACTERIZATION OF THE HUMAN CD4 DOMAIN WHICH IS REQUIRED FOR 
HIV MEDIATED SYNCYTIUM FORMATION 



These non-human primate CD4 antigens were expressed on human cells which were thereby rendered 
susceptible to infection by HIV, but formed strikingly fewer multinucleated giant cells, or syncytia, than their 
counterparts expressing the human CD4 antigen. Using in vitro mutagenesis this phenotype was localized to 
a single amino acid difference between the chimpanzee and human CD4 glycoproteins. This amino acid 
substitution quantitatively affects the ability of HeLa cells to form syncytia when these antigens are 
expressed in concert with the external and trans membrane proteins (EMP and TMP) of the human 
immunodeficiency virus type I (HIV). This was achieved by transiently expressing six trans-species hybrid 
CD4 antigens, which contain each of the three nonconservative extracellular amino acid sequence changes 
between the two species alone and in pairs, followed by infection with the Vaccinia:(HIV env ) recombinant 
55 virus VSC25. The presence of a glycine residue at position 87, as found in chimpanzee CD4 t instead of the 
glutamic acid residue found in human CD4, essentially eliminates the formation of multinucleated syncytia. 
Conversely the transfer of the human glutamic acid residue at position 87 to the chimpanzee CD4 confers 
the ability to form syncytia in the presence of HIV EMP and TMP. In contrast the absence or presence of 
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either of the two amino acid substitutions which create glycosylate sites unique to the chimpanzee CD4 
antigen, at amino acids 34 and 68 in the first immunoglobulin variable region homologous domain, has little 
or no effect on the extent of syncytium formed in this assay. We expect that all of these hybrid CD4 
glycoproteins will show equal affinity for HIV EMP, since none of these amino acid sequence differences 
are in the HIV binding site defined earlier. 

If syncytium formation is an important mechanism of HIV induced, disease this blockade of HIV 
mediated syncytium formation may account for the resistance of the chimpanzee to the pathology of the 
acquired immune deficiency syndrome (AIDS) despite prolonged infection by HIV. 



EXAMPLE 3 PREPARATION OF CD4-IQ cDNA CONSTRUCTS 



The Extracellular portion of the chimpanzee or rhesus monkey coding sequence (encoding the signal 
peptides and amino acids 1-372 of the mature glycoproteins) is fused at three locations to a human lgG1 
heavy chain constant region gene by means of a synthetic splice donor linker molecule. To exploit the 
splice donor linker, a BaMHI linker having the sequence CGCGGATCCGCG is first inserted at amino acid 
residue 395 of the CD4 precursor sequence (nucleotide residue 1295). A synthetic splice donor sequence 

6ATCCCGAGGGTGAGTACTA 
GGCTCCCACTCATGATTCGA 

bounded by BamHI and Hindlll complementary ends is created and fused to the Hindlll site in the intron 
preceding the CH1 domain, to the Espl site in the intron preceding the hinge domain, and to the Bani site 
preceding the CH2 domain of the lgG1 genomic sequence. Assembly of the chimeric genes by ligation at 
the BaMHI site affords molecules in which either the variable (V) region, the V+CH1 regions, or the V. CH1 
and hinge regions are replaced by CD4. In the last case, the chimeric molecule is expected to form a 
monomer structure, while in the former, a dimeric molecule is expected. , 

Immunoprecipitation of the fusion proteins with a panel of monoclonal antibodies directed against CD4 
epitopes will show that all of the epitopes are preserved. A specific high affinity association is demonstrated 
between the chimeric molecules and HIV envelope proteins expressed on the surface of ceils transfected 
with an attenuated (reverse transcriptase deleted) proviral construct, or infected with a vaccinia:HIV env 
recombinant virus. 



EXAMPLE 4 PREPARATION OF THE FUSION PROTEINS FROM SUPERNATANTS OF COS CELLS 

COS cells grown in DME medium supplemented with 10% Calf Serum and gentamicin sulfate at 15 
U,g/ml are split into DME medium containing 10% NuSerum (Collaborative Research) and gentamicin to 
give 50% confluence the day before transfection. The next day, CsCI purified plasmid DNA is added to a 
final concentration of 0.1 to 2.0 ug/ml followed by DEAE Dextran to 400 ug/ml and chloroquine to 100 uM. 
After 4 hours at 37* C, the medium is aspirated and a 10% solution of dimethyl sulfoxide in phosphate 
buffered saline is added for 2 minutes, aspirated, and replaced with DME/10% Calf Serum. 8 to 24 hours 
later, the cells are trypsinized and split 1 :2. 

For radiolabeling, the medium is aspirated 40 to 48 hours after transfection, the cells are washed once 
with phosphate buffered saline, and DME medium lacking cysteine or methionine is added. 30 minutes 
later. 35 S-labeled cysteine and methionine are added to final concentrations of 30-60 aci and 100-200 uci 
respectively, and the ceils allowed to incorporate label for 8 to 24 more hours. The supernatants are 
recovered and examined by electrophoresis on 7.5% polyacrylamide gels following denaturation and 
reduction, or on 5% polyacrylamide following denaturation without reduction. The lgG-CD4 fusion proteins 
form dimer structures. The CD4-lgM fusion proteins form large multimers beyond the resolution of the gel 
system without reduction, and monomers of the expected molecular mass with reduction. 

Unlabeled proteins are prepared by allowing the cells to grow for 5 to 10 days post transfection in DME 
medium containing 5% NuSerum and gentamicin as above. The supernatants are harvested, centrifuged. 
and purified by batch adsorption to either protein A trisacryl, protein A agarose, goat anti-human IgG 
antibody agarose, rabbit anti-human IgM antibody agarose, or monoclonal anti-CD4 antibody agarose. 
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Antibody agarose conjugates are prepared by coupling purified antibodies to cyanogen bromide activated 
agarose according to the manufacturers recommendations, and using an antibody concentration of 1 
mg/ml. Following batch adsorption by shaking overnight on a rotary table, the beads are harvested by 
pouring into a sintered glass funnel and washed a few times on the funnel with phosphate buffered saline 
s containing 1% Nonidet P40 detergent. The beads are removed from the funnel and poured into a small 
disposable plastic column (Quik-Sep QS-Q column, Isolab), washed with at least 20 column volumes of 
phosphate buffered saline containing 1% Nonidet P40, with 5 volumes of 0.15 M NaCI, 1 mM EDTA (pH 
8.0), and eluted by the addition of either 0.1 M acetic acid, 0.1 M acetic acid containing 0.1 M NaCI, or 0.25 
M glycine-HCI buffer, pH 2.5. 

EXAMPLE 5 BLOCKAGE OF SYNCYTIUM FORMATION BY THE FUSION PROTEINS 

Purified or partially purified fusion proteins are added to HPB-ALL cells infected 12 hours previously 
is with a vaccinia virus recombinant encoding HIV envelope protein. After incubation for 6-8 more hours, the 
cells are washed with phosphate buffered saline, fixed with formaldehyde, and photographed. All of the full- 
length CD4 immunoglobulin fusion proteins will show inhibition of syncytium formation. 

Having now fully described this invention, it will be appreciated by those skilled in the art that the same 
can be performed with any wide range of equivalent parameters of composition, conditions, and methods of 
20 preparing such recombinant molecules, vectors, transformed hosts and proteins without departing from the 
spirit of scope of the invention or any embodiment thereof. 

Claims 

25 

1. A nucleic acid molecule specifying non-human primate CD4, or an HIV gp120 binding fragment thereof, 
which preferably is soluble in aqueous solution. 

2. The nucleic acid molecule of claim 1 which is DNA, RNA or is complementary to the nucleic acid 
molecule of claim 1. 

30 3. The nucleic acid molecule of claim 1 which is detectably labeled. 

4. The nucleic acid molecule of claim 1, wherein said non-human primate is the rhesus monkey and said 
molecule comprises the following ONA sequence: 
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1 ATGAACCGGGGAATCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCGCTACTCCCA 
-25 MetAsnArgGlyIleProPheArgEiaLeuLetiI*euValL«uGlnI*euAlaLeuLeuPro 

GCAGTCACCCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGATACAGTGGAACTGACC 120 
AlaValThrGlnGlyLysLysValValLeuGlyLysLyaGlyAspThrValGluLeuThr 15 

70 •••••• 

121 TGTACAGCTTCGCAGAAGAAGAA»CACAATTCCACTGGAAAAACTCa^CCAGATAAAG 
16 CysThrAlaSerGlnLyaLyaAsnThrGlnPheHiaTrpLysAsnSexAanGlnlleLys 

75 

ATTCTGGGAATTCAGGGTCTCTTCTTAACTAAAGGTCCATCCAAGCTGAGCGATCGTGCT 240 
IleLeuGlylleGlnGlyLeuPheLeuThrLysGlyProSerLysLeuSerAapArgAla 55 

20 •••••• 

241 GACTCAAGAAAAAGCCTTTGGGACCAAGGATGCTTTTCCATGATCATCAAGAATCTTAAG 
56 AapSerArgLyaSerLeuTxpAapGlnGlyCyaPheSecMetIleIleLyaAsnLeuI*ys 

25 

ATAGAAGACTCAG^TACTTACATCTGTGAAGTGGAGAACAAGAAGGAGGAGGTGGAATTG 360 
IleGluAapSerAapThrTyrlleCysGluValGluAanLysLyaGluGluValGluLeu 95 

30 

361 CTGGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTTGAGGGGCAAAGCCTGACC 
96 LeuValPheGlyLeuThrAlaAanSorAapThxHiaLeuLeuGluGlyGlnSerLeuThr 

35 



40 



45 , 
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10 



75 



20 



25 



30 



35 



40 



CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGAAATGTAGGA6TCCAGGGGGT 480 
LeuThrLeuGluSerProProGlySerSerProSerValLysCyaArgSorProGlyGly X35 



481 AAAAACATACAGGGGGGGAGGACCATCTCTGTGCCTCAGCTGGAGCGCCAGGATAGTGGC 
136 LysAsnlleGlnGlyGlyArgThrlleSerValProGlnLeuGluArgGlnAspSerGly 



ACCTGGACATGCACCGTCTCGCAGGACCAGAAGACGGTGGAGTTCAAAATAGACATCGTG 600 
ThrTrpThrCysThrValSerGlnAapGlnLysThrValGluPheLyalleAapIleVal 175 



601 GTGCTAGCTTTCCAGAAGGCCTCCAGCACAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 
176 ValLeuAlaPheGlnLyoAlaSerSerThrValTyrLyeLysGlwGlyGluGlnValGlu 



TTCTCCTTCCCACTCGCCTTTACACTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 720 
PheSerPheProLeuAlaPheThrLeuGluLysLeuThrGlySerGlyGluLeuTrpTrp 215 



721 CAGGCGGAGAGGGCCTCCTCCTCCAAGTCTTGGATTACCTTCGACCTGAAGAACAAGGAA 
216 GlnAlaGluArgAlaSerSerSerLyaSerTrpIleThrPheAapLeuLyaAanLyaGlu 



GTGTCTGTAAAACGGGTTACCCAGGACCCCAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 840 
ValSexValLyaArgValThrGlnAapProLyaLeuGlnMetGlyLyaliysLeuProLeu 255 



841 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACGCTGGCC 
256 HisLeuThrLeuProGlnAlaLeuProGlnTyrAlaGlySerGlyAanLeuThrl^UAla 



CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTCGTGGTGATGAGAGCCACT 960 
LeuGluAlaLyaThrGlyLyaL«uBiaGlnGluValAanLeuValValMatArgAlaThr 295 



45 
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9 61 CAGTTCCAGGAAAATTTGACCTGTGAAGTGTGGGGACCCACCTCCCCTAAGCTGACGCTG 
296 GlnPheGlnGluAanLeuThrCyaGluValTrpGlyPrpThrSerProLyaLeuThrLeu 

• • • • • • 

AGCTTGAAACTGGAGAACAAGGGGGCAACGGTCTCGAAGCAGGCGAAGGCGGTGTGGGTG 1080 

SerLeuLysLeuGluAanLysGlyAlaThrValSerLyaGlnAlaLyaAlaValTrpVal 335 

. . « • • • 

1081 CTGAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTA 
336 LeuAsnProGluAlaGlyMetTrpGlnCysLeuLeuSerAapSerGlyGlnValLeuLeu 

• • • • • • 
GAATCCAACATCAAGGTTGTGCCCACATGGCCCACCCCGGTGCAGCCAATGGCCCTGATT 1200 

_GluScrAsnIleLyaValValProThrTrpProThxProValGlnProMetAlaLeuIle 375 

• • • • • 

1201 GTGCTGGGGGGCGTTGCGGGCCTCCTGCTTTTCACTGGGCTAGGCATCTTCTTCTGTGTC 
376 ValLeuGlyGlyValAlaGlyLeuLeuLeuPheThrGlyLeuGlyllePhePheCyaVal 

• • • • * * 
AGGTGCCGGCATCGAAGGCGTCAAGCAGAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 1320 
ArgCyaArgBiaArgArgArgGlnAlaGluArgMetSerGlnlleLyaArgLeuLeuSer 415 

• • « • • 
1321 GAAAAGAAGACCTGCCAGTGCCCTCACCGGTTTCAGAAGACATGTAGCCCCATT 

416 GluLyaLyaThrCyaGlnCysProHiaArgPheGlnLyaThrCyaSerProIle; 

or a degenerate variant thereof, or wherein said non-human primate is the rhesus monkey and the said 
nucleic acid fragment comprises the following DNA sequence: 

• • • • • • 
1 ATGAACCGGGGAATCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCGCTACTCCCA 

-25 MetAanArgGlylleProPheArgHiaLeuLeuLeuValLeuGlnXeuAlaLeuLeuPro 

• • • • • • 
GCAGTCACCCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGATACAGTGGAACTGACC 120 
AlaValThrGlnGlyLysLyaValValLeuGlyLysLyaGlyAapThrValGluLeuThr 15 
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121 TGTACAGCTTCGCAGAAGAAGAACACACAATTCCACTGGAAAAACTCCAACCAGATAAAG 
16 CyaThrAlaSerGlnLysLyoAsnThrGlnPheHiaTrpLysAanSerAanGlnlleLys 

• • « • • • 

ATTCTGGGAATTCAGGGTCTCTTCTTAACTAAAGGTCCATCCAAGCTGAGCGATCGTGCT 240 
IleLeuGlylleGlnGlyLeuPheLeuThrLysGlyProSerLyaLeuSerAapArgAla 55 



241 GACTCAAGAAAAAGCCTTTGGGACCAAGGATGCTTTTCCATGATCATCAAGAATCTTAAG 
56 AapSerArgLyaSerLeuTrpAapGlnGlyCy«PheSerMetIleIleLy3AanI*euI.ya 

• • • • • • • 

ATAGAAGACTCAGATACTTACATCTGTGAAGTGGAGAACAAGAAGGAGGAGGTGGAATTG 360 
IleGluAapSerAapThrTyrlleCyaGluValGluAanLysLyaGluGluValGluI^u . „ 95 

• • • • 

361 CTGGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTT 
96 LeuValPheGlyLeuThrAlaAanSerAapThrHiaLeuLeu ; 

or a degenerate variant thereof, or wherein said non-human primate is the chimpanzee and said molecule 
comprises the following DNA sequence: 

• • • • • • 

1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCACTCCTCCCA 
-25 MetAanArgGlyValProPheArgHiaLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

• • • * • * 
GCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGACACAGTGGAACTGACC 120 
AlaAlaThrGlnGlyLyaLyaValValLeuGlyLyaLyaGlyAapThrValGluLeuThr 15 

• • • • • • 
121 TGTACAGCTTCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGACAAAG 

16 CyaThrAlaSerGlnLysLyaSerlleGlnPheHieTrpLyaAanSerAanGlnThxLya 

• • • • • • 

ATTCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGTT 240 
IleLeuGlyAanGlnGlySerPheLeuThrLyaGlyProSerLyaLeiaAanAapArgVal 55 

• • * • • * 
241 GACTCAAGAAGAAGCCTTTGGGACCAAGGAAACTTTACCCTGATCATCAAGAATCTTAAG 

56 AapSerArgArgSerLeuTrpAapGlnGlyAanPheThrLeuIlelleLyaAanLeuLya 
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ATAGAAGACTCAGATACTTACATCTGTGAAGTGGGGGACCAGAAGGAGGAGGTGCAATTG 
IleGluAapSerAspThrTyrlleCysGluValGlyAapGlnLyeGluGluValGlnLeu 

• • • • • • 

361 CTAGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCCTGACC 
96 LeuValPheGlyLeuThrAlaAsnSerAapThxHiaLeuLeuGlnGlyGlnSerLeuThr 

• • • * • • 
CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAAGGGGT 
LeuThrLeuGluSerProProGlySerSerProSerValGlnCyoArgSerProAxgGly 

• • • • ■ • 

481 AAAAACATACAGGGGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGC 
136 LysAsnlleGlnGlyGlyLysThrLeuSerValSerGlnLeuGluLeuGlnAspSerGly 

• • • • • * 

ACCTGGACATGCACTGTCTTGCAGAACCAGAAGAAAGTGGAGTTCAAAATAGACATCGTG 
ThrTrpThrCyaThrValLeuGlnAsnGlnLysLyaValGluPheLyoIleAapIleVal 

• * • • • • 
601 GTGCTAGCTTTCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 
176 ValLeuAIaPheGlnLyaAlaSerSerlleValTyrLysLyaGluGlyGluGlnValGlu 

• • • * • • 
TTCTCCTTCCCACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 
PheSerPheProLeuAlaPheThrValGluLysLeuThrGlySerGlyGluLeuTrpTrp 

• • • « • • 
721 CAGGCGGAGAGGGCTTCCTCCTCCAAGTCTTGGATCACCTTTGACCTGAAGAACAAGGAA 
216 GlnAlaGluArgAlaSerSerSerLysSerTxpIleThrPheAapLeuLysAanLysGlu 

• • ■ • • * • 
GTGTCTGTAAAACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 
ValSerValLyaArgValThrGlnAspProLyaLeuGlnMetGlyLyaLyaLeuFroLeu 

• ••>*• 

841 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCT6GAAACCTCACCCTGGCC 
256 HiaLeuThrLeuProGlnAlaLeuProGlnTyrAlaGlySerGlyAanLeuThrLeuAla 

• • • • ■ • 
CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTCGTGGTGATGAGAGCCACT 
LeuGluAlaLysThrGlyLyaLeuBiaGlnGluValAanLeisValValMetArgAlaThr 
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9 61 CAGCTCCAGAAAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTG 
296 GlnLeuGlnLyaAanL«uThrCy5GluValTrpGlyProThrSerProLy»LeuMetL«u 

• • • • • • 
AGCTTGAAACTGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGAGAAGGCGGTGTGGGTG 1080 

SerLeuLyaLeuGluAsnLysGluAlaLysValSerLysArgGluLysAlaValTrpVal 335 

• . • • • * 
1081 CI GAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTG 

336 LeuAanProGluAlaGl^etTrpGlnCysLeuLeuSerAspSerGlyGlnValLeuLeu 



GAATCCAACATCAAGGTTCTGCCCACATGGTCCACCCCGGTGCAGCCAATGGCCCTGATT 1200 
GluSorAsnlleLyaValLouProThrTrpSarThrProValGlnProMetAlaLeuIla 375 

• • « • • • 
1201 GTGCTGGGGGGCGTCGCCGGCCTCCTGCTTTTCATTGGGCTAGGCATCTTCTTCTGTGTC 

376 valLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlyllePhePheCyaVal 

• • • • • • 
AGGTGCCGGCACCGAAGGCGCCAAGCACAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 1320 
ArgCysAxgHiaArgArgAxgGlnAlaGlnArgMetSerGlnlleLyaArgLeuLeuSor 415 

• * • * • 
1321 GAGAAGAAGACCTGCCAGTGCCCTCACCGGTTTCAGAAGACATGTAGCCCCATT 

416 GluLyaLyaThrCyaGlnCyaProHiaArgPheGlnLyaThrCyaSerProIla ; 

or a degenerate variant thereof, or wherein said non-human primate is the chimpanzee and 
said nucleic acid fragment comprises the following DNA sequence: 

• • • • • - 

1 ATGAACCGGGGAGTCCCTXTTAGGCACTTGCTTCTGGTGCTGCAACTGGCACTCCTCCCA 
-25 MetAanArgGlyValProPheArgEieLeuLauLouValLeuGlnLeuAlaLeuLeuPro 

• • • • • * 
GCAGCCACTCAGGGAAAGAAAGXGGTGCTGGGCAAGAAAGGGGACACAGTGGAACTGACC 120 
AlaAlaThrGlnGlyLysLyaValValLeuGlyLyaliyaGlyAapThrValGluLeuTbr 15 
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121 TGTACAGCTTCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGACAAAG 
16 CysThrAlaSerGlnLysLysSerlleGlnPheBiaTrpLydAflnSerAanGlnThrLys 

ATTCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGTT 
IleLeuGlyAsnGlnGlySerPheLeuThrLyaGlyProSerLysLeuAanAspAxgVal 

• ••••• 
241 GACTCAAGAAGAAGCCTTTGGGACCAAGGAAACTTTACCCTGATCATCAAGAATCTTAAG 

5 6 AapSerArgArgSerLeuTrpAspGlnGlyAanPheThrLeuIlelleLyaAsnLeuLys 

• • * • • • 
ATAGAAGACTCAGATACTTACATCTGTGAAGTGGGGGACCAGAAGGAGGAGGTGCAATTG 
IleGluAapSerAapThrTyrlleCysGluValGlyAspGlnLyoGluGluValGlnLeu 

• • • • 

3 61 CTAGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTT 
96 LeuValFheGlyLeuThrAlaAanScxAspThrBisLeuLeu ; 

or a degenerate variant thereof. 

5. A recombinant DNA molecule comprising the following sequence: 

• • * • • * 

1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTC7GGTGCTGCAACTGGCACTCCTCCCA 

• • * • • • 
GCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAGAAAGGGGACACAGTGGAACTGACC 

■ • * • • • 

121 TGTACAGCTTCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGAyAAAG 

• • • • • • 
ATTCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGYT 

241 GACTCAAGAAGAAGCCTTTGGGACCAAGGAAACTTTMCCCTGATCATCAAGAATCTTAAG 

*■•••• 
ATAGAAGACTCAGATACTTACATCTGTGAAGTGGGGGACCAGAAGGAGGAGGTGCAATTG 
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361 CTAGTGTTCGGATT6ACTGCCAACTCTGACACCCACCTGCTTCAGGGGCAGAGCCTGACC 



CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGrrCCAAGGGGT 480 



481 AAAAACATACAGGGGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGC 



ACCTGGACATGCACTGTCTTGCAGAACCAGAAGAAAGTGGAGTTCAAAATAGACATCGTG 600 



601 GTGCTAGCTOTCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 



TTCTCCTTCCCACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 720 



721 CAGGCGGAGAGGGCTTCCTCCTCCAAGTCTTGGATCACCTTTGACCTGAAGAACAAGGAk 



GTGTCTGTAAAACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 840 



841 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCC 



CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTCGTGGTGATGAGAGCCACT 960 



961 CAGCTCCAGAAAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTG 



AGCTTGAAACTGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGA6AAGGCGGTGTGGGTG 1080 



1081 CTGAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTG 



GAATCCAACATCAAGGXTCTGCCCACATGG7CCACCCCGG7GCAGCCAATGGCCCTGATT 1200 
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1201 GTGCTGGGGGGCGTCGCCGGCCTCCTGCTTTTCATTGGGCTAGGCATCTTCTTCTGTGTC 



AGGTGCCGGCACCGAAGGCGCCAAGCASAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 1320 



1321 GAGAAGAAGACCTGCCAGTGCCCTCACCGGTTTCAGAAGACATGTAGCCCCATT ; 

wherein Y is C or T, 
M is A or C, and 
S is C or G; 

or a degenerate variant thereof. 

6. A nucleic acid molecule specifying glycosylated human CD4 with the cytoplasmic domain, comprising 
the following DNA sequence: 

«•»••• 

1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCGCTCCTCCCA 



GCAGCCAC7CAGGGAAAGAAAGTGGTGCTGGGCAAAAAAGGGGATACAGTGGAACTGACC 120 



121 TGTACAGC7TCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGAYAAAG 



ATTCTGGGAAATCAGGGC7CCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGCT 240 



241 GACTCAAGAAGAAGCCTTTGGGACCAAGGAAACTTTMCCCTGATCATCAAGAATCTTAAG 



ATAGAAGACTCAGATAC7TACATCTG7GAAGTGGGGGACCAGAAGGAGGAGGTGCAATTG 360 



361 CTAG7G7TCGGATTGACTGCCAAC7CTGACACCCACCTGCTTCAGGGGCAGAGCCTGACC 



CTGACCTTGGAGAGCCCCCCTGGTAGTAGCCCCTCAGTGCAATGTAGGAGTCCAAGGGG7 480 
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481 AAAAACATACAGGCGGGGAAGACCCTCTCCGTGTCTCAGCTGGAGCTCCAGGATAGTGGC 

• ••••• 
ACCTGGACATGCACTGTCTTGCAGAACCAGAAGAAGGTGGAGTTCAAAATAGACATCGTG 600 

• • • • • • 

601 GTGCTAGCTTTCCAGAAGGCCTCCAGCATAGTCTATAAGAAAGAGGGGGAACAGGTGGAG 

• ••••• 

TTCTCCTTCCCACTCGCCTTTACAGTTGAAAAGCTGACGGGCAGTGGCGAGCTGTGGTGG 720 

721 CAGGCGGAGAGGGCTTCCTCCTCCAAGTCTTGGATCACCTTTGACCTGAAGAACAAGGAA 

GTGTCTGTAAAACGGGTTACCCAGGACCCTAAGCTCCAGATGGGCAAGAAGCTCCCGCTC 840 

• ••••• 

841 CACCTCACCCTGCCCCAGGCCTTGCCTCAGTATGCTGGCTCTGGAAACCTCACCCTGGCC 

CTTGAAGCGAAAACAGGAAAGTTGCATCAGGAAGTGAACCTGGTGGTGATGAGAGCCACT 9 60 

.«•••• 

961 CAGCTCCAGAAAAATTTGACCTGTGAGGTGTGGGGACCCACCTCCCCTAAGCTGATGCTG 

• ••••• 

AGCTTGAAACTGGAGAACAAGGAGGCAAAGGTCTCGAAGCGGGAGAAGGCGGTGTGGGTG 1080 

• • • • • • 
1081 CTGAACCCTGAGGCGGGGATGTGGCAGTGTCTGCTGAGTGACTCGGGACAGGTCCTGCTG 

• ••••• 
GAATCCAACATCAAGGTTCTGCCCACATGGTCCACCCCGGTGCAGCCAATGGCCCTGATT 1200 

• ••••• 
1201 GTGCTGGGGGGCGTCGCCGGCCTCCTGCTTTTCATTGGGCTAGGCATCTTCTTCTGTGTC 

• • • • • • 
AGGTGCCGGCACCGAAGGCGCCAAGCAGAGCGGATGTCTCAGATCAAGAGACTCCTCAGT 1320 

• « • • • 

1321 GAGAAGAAGACCTGCCAGTGCCCTCACCGGTTTCAGAAGACATGTAGCCCCATTTGA 1377 

wherein Y is C or T. and 
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10 



15 



25 



30 



M Is A or C; 

or a degenerate variant thereof; 

with the proviso that both Y is not T and M is not C at the same time. 

7. A nucleic acid molecule specifying a glycosylated human CD4 fragment, comprising the following DNA 
sequence: 



1 ATGAACCGGGGAGTCCCTTTTAGGCACTTGCTTCTGGTGCTGCAACTGGCGCTCCTCCCA 



GCAGCCACTCAGGGAAAGAAAGTGGTGCTGGGCAAAAAAGGGGATACAGTGGAACTGACC 120 



121 TGTACAGCTTCCCAGAAGAAGAGCATACAATTCCACTGGAAAAACTCCAACCAGAYAAAG 



20 ATTCTGGGAAATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAAGCTGAATGATCGCGCT 240 



241 GACTCAAGAAGAAGCCTTTGGGACCAAGGAAACTTTMCCCTGATCATCAAGAATCTTAAG 



ATAGAAGACTCAGATACTTACATCTGTGAAGTGGAGGACCAGAAGGAGGAGG7GCAATTG 360 



361 CTAGTGTTCGGATTGACTGCCAACTCTGACACCCACCTGCTT 



wherein Y is C or T, and 
35 M is A or C; 

or a degenerate variant thereof; 

with the proviso that both Y is not T and M is not C at the same time. 

8. A nucleic acid molecule specifying a fusion protein, comprising: 

1) the nucleic acid molecule of claim 1, linked to 

40 2) a nucleic acid molecule specifying an immunoglobulin heavy chain, preferably of the class IgM, lgG1 or 
lgG3. 

wherein the nucleic acid sequence which specifies the variable region of said immunoglobulin heavy chain 
has been replaced with said nucleic acid molecule specifying said fragment. 

9. A nucleic acid molecule specifying a fusion protein, comprising: 

45 1) a nucleic acid molecule specifying a non-human primate CD4, or HIV or SIV gp120 binding fragment 
thereof, linked to 

2) a nucleic acid molecule specifying an immunoglobulin light chain, preferably of the class IgM. IgGl or 
lgG3, 

wherein the nucleic acid sequence which specifies the variable region of said immunoglobulin light chain 
so has been replaced with said nucleic acid molecule specifying said fragment. 

10. A nucleic acid molecule specifying a fusion protein, comprising: 

1) a nucleic acid molecule specifying a non-human primate CD4, or HIV or SIV gp120 binding fragment 
thereof, linked to 

2) a nucleic acid molecule specifying a cytotoxic polypeptide. 

55 11. A vector comprising the nucleic acid molecule of any one of claims 1 or 4 to 10. 

12. A host transformed with the vector of claim 11, especially a host transformed with a vector comprising 
the nucleic acid molecule of claim 8, wherein said host expresses an immunoglobulin light chain together 
with the expression product of nucleic acid molecule to give an immunoglobulin-like molecule which binds 
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to HIV or SIV gp120, or a host transformed with a vector comprising the nucleic acid molecule of claim 9, 
wherein said host expresses an immunoglobulin heavy chain together with the expression product of nucleic 
acid molecule to give an immunoglobulin-like molecule which binds to HIV or SIV gp120, and wherein said 
immunoglobulin heavy chain is preferably of the immunoglobulin class IgM, lgG1 or lgG3. 

13. A method of producing non-human primate CD4, or fragment thereof which binds to HIV or SIV gp120, 
which comprises 

cultivating in a nutrient medium under protein-producing conditions, a host strain transformed with a vector 
comprising the nucleic acid molecule of claim 1, said vector further comprising expression signals which 
are recognized by said host strain and direct expression of said non-human primate CD4, and recovering 
the non-human primate CD4 so produced. 

14. A method of producing a fusion protein comprising non-human primate CD4, or fragment thereof which 
binds to gp120, and an immunoglobulin heavy chain, wherein the variable region of the immunoglobulin 
chain has been substituted with non-human primate CD4, or fragment thereof which binds to HIV or SIV 
gp120, which comprises 

cultivating in a nutrient medium under protein-producing conditions, a host strain transformed with a vector 
comprising the nucleic acid molecule of claim 8, said vector further comprising expression signals which 
are recognized by said host strain and direct expression of said fusion protein, and 
recovering the fusion protein so produced, andwherein said host strain preferably is a myeloma cell line 
which produces immunoglobulin light chains and said fusion protein comprises an immunoglobulin heavy 
chain of the class IgM, !gG1 or lgG3, wherein an immunoglobulin-like molecule comprising said fusion 
protein is produced. 

15. A method of producing a fusion protein comprising non-human primate CD4, or fragment thereof which 
binds to HIV or SIV gp120, and an immunoglobulin light chain, wherein the variable region of theim- 
munoglobulin chain has been substituted with non-human primate CD4 t or fragment thereof which binds to 
HiV or SIV gp120 f which comprises: 

cultivating in a nutrient medium under protein -producing conditions, a host strain transformed with a vector 
comprising the nucleic acid molecule of claim 9, said vector further comprising expression signals which 
are recognized by said host strain and direct expression of said fusion protein, and 

recovering the fusion protein so produced, and wherein said host preferably produces immunoglobulin 
heavy chains of the class IgM, lgG1 and igG3 together with said fusion protein to give an immunoglobulin- 
like molecule which binds to HIV or SIV gp120. 

16. Substantially pure non-human primate CD4, especially a substantially pure non-human primate CD4, 
wherein said non-human primate is the rhesus monkey, comprising the following amino acid sequence: 
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MetAsnArgGlylleProPheArgHisLeuLeuLeuValLeuGlnLeuAiaLeuLeuPro 

AlaValThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 

CysThrAlaSerGlnLysLysAsnThrGlnPheHisTrpLysAsnSerAsnGlnlleLys 

HeLeuGlylleGlnGlyLeuPheLeuThrLysGlyProSerLysLeuSerAspArgAla 

AspSerArgLysSerLeuTrpAspGlnGiyCysPheSerMetllelleLysAsnLeuLys 

HeGluAspSerAspThrTyrlleCysGluValGluAsnLysLysGluGluValGluLeu 

LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeuGluGlyGlnSerLeuThr 

LeuThrLeuGluSerProProGlySerSerProSerValLysCysArgSerProGlyGly 

LysAsnlleGlnGlyGlyArgThrlleSerValProGlnLeuGluArgGlnAspSerGly 

ThrTrpThrCysThrValSerGlnAspGlnLysThrValGluPheLyslleAsplleVal 

ValLeuAlaPheGlnLysAlaSerSerThrValTyrLysLysGluGlyGluGlnValGlu 

PheSerPheProLeuAiaPheThrLeuGluLysLeuThrGlySerGlyGluLeuTrpTrp 

GInAlaGluArgAlaSerSerSerLysSerTrplleThrPheAspLeuLysAsnLysGlu 

ValSerValLysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 

HisLeuThrLeuProGlnAlaLeuProGlnTyrAlaGlySerGlyAsnLeuThrLeuAla 

LeuGluAlaLysThrGlyLysLeuHisGlnGluValAsnLeuValValMetArgAlaThr 

GlnPheGlnGluAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuThrLeu 

SerLeuLysLeuGluAsnLysGlyAlaThrValSerLysGlnAlaLysAlaVaiTrpVal 

LeuAsnProGluAlaGlyMetTrpGlnCysLeuLeuSerAspSerGlyGlnValLeuLeu 

GluSerAsnlleLysValValProThrTrpProThrProValGlnProMetAlaLeulle 

ValLeuGlyGlyValAlaGlyLeuLeuLeuPheThrGlyLeuGlyllePhePheCysVal 

ArgCysArgHisArgArgArgGlnAlaGluArgMetSerGlnlleLysArgLeuLeuSer 

GluLysLysThrCysGlnCysProHisArgPheGlnLysThrCysSerProlle, 

or a substantially pure non-human CD4, wherein said non-human primate is the chimpanzee, comprising the 
following amino acid sequence. 
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MetAsnArgGlyValProPheArgHisLeuLeuLeuValLeuGtnLeuAlaLeuLeuPro 
AlaAlaThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 
CysThrAlaSerGlnLysLysSerlleGlnPheHisTrpLysAsnSerAsnGlnThrLys 
lleLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgVal 
AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeullelleLysAsnLeuLys 
HeGluAspSerAspThrTyrlleCysGluValGlyAspGlnLysGluGluValGlnLeu 
LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeuGlnGlyGlnSerLeuThr 
LeuThrLeuGluSerProProGlySerSerProSerValGlnCysArgSerProArgGly 
LysAsnlleGlnGlyGlyLysThrLeuSerValSerGlnLeuGluLeuGlnAspSerGly 
ThrTrpThrCysThrValLeuGlnAsnGlnLysLysValGluPheLyslleAsplleVal 
ValLeuAlaPheGlnLysAlaSerSerlleValTyrLysLysGluGlyGluGlnValGlu 
^PheSerPheProLeuAlaPheThrValGluLysLeuThrGlySerGfyGluLeutrpTrp 
GlnAlaGluArgAlaSerSerSerLysSerTrplleThrPheAspLeuLysAsnLysGlu 
ValSerValLysArgVaJThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 
HisLeuThrLeuProGlnAlaLeuProGlnTyrAlaGlySerGlyAsnLeuThrLeuAla 
LeuGluAlaLysThrGlyLysLeuHisGlnGluValAsnLeuValValMetArgAlaThr 
GInLeuGlnLysAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuMetLeu 
SerLeuLysLeuGluAsnLysGluAlaLysValSedysArgGluLysAlaValTrpVal 
LeuAsnProGluAlaGlyMetTrpGlnCysLeuLeuSerAspSerGlyGlnValLeuLeu 
GluSerAsnlleLysValLeuProThrTrpSerThrProValGlnProMetAlaLeulle 
ValLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlyllePhePheCysVal 
ArgCysArgHisArgArgArgGlnAlaGlnArgMetSerGlnlleLysArgLeuLeuSer 
GluLysLysThrCysGlnCysProHisArgPheGlnLysThrCysSerProlle; 

or the glycosylated derivative thereof, or a substantially pure non-human C04 comprising the following 
amino acid sequence: 
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MetAsnArgGlyValProPheArgHisLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 
AlaAlaThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 
CysThrAlaSerGlnLysLysSerlleGlnPheHisTrpLysAsnSerAsnGln-@-Lys 
HeLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArg-#- 
AspSerArgArgSerLeuTrpAspGlnGlyAsnPhe-$-LeullelleLysAsnLeuLys 
HeGluAspSerAspThrTyrlleCysGluValGlyAspGlnLysGluGluValGlnLeu 
LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeuGlnGlyGlnSerLeuThr 
LeuThrLeuGluSerProProGlySerSerProSerValGlnCysArgSerProArgGly 
LysAsnlleGlnGlyGlyLysThrLeuSerValSerGlnLeuGluLeuGlnAspSerGly 
ThrTrpThrCysThrValLeuGlnAsnGlnLysLysValGluPheLyslleAsplleVal 
ValLeuAlaPheGlnLysAlaSerSerlleValTyrLysLysGluGlyGluGlnValGlu 
PheSerPheProLeuAlaPheThrValGlul^euThrGiySeTGIyGluLeuTrp'Tfp 
GlnAlaGluArgAlaSerSerSerLysSerTrplleThrPheAspLeuLysAsnLysGlu 
ValSerValLysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 
HisLeuThrLeuProGlnAlaLeuProGlnTyrAlaGlySerGlyAsnLeuThrLeuAla 
LeuGluAlaLysThrGlyLysLeuHisGlnGluValAsnLeuValValMetArgAlaThr 
GInLeuGlnLysAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuMetLeu 
SerLeuLysLeuGluAsnLysGluAlaLysValSerLysArgGluLysAlaValTrpVal 
LeuAsnProGluAlaGlyMetTrpGlnCysLeuLeuSerAspSerGlyGlnValLeuLeu 
GluSerAsnlleLysValLeuProThrTrpSerThrProValGlnProMetAlaLeulle 
ValLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlyllePhePheCysVal 
36 ArgCysArgHisArgArgArgGlnAla-%-ArgMetSerGlnlleLysArgLeuLeuSer 
GluLysLysThrCysGlnCysProHisArgPheGlnLysThrCysSerProlle, 

40 wherein 

*@- is Thr or lie, 
is Val or Ala. 
-S- is Thr or Pro, and 
-%- is Gin or Glu; 

45 or the glycosylated derivative thereof, or a substantially pure non-human CD4 comprising the following 
amino acid sequence: 
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MetAsnArgGlyValProPheArgHisLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

AlaAlaThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 

CysThrAlaSerGlnLysLysSerlleGlnPheHisTrpLysAsnSerAsnGln-<^-Lys 

HeLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArg-#- 

AspSerArgArgSerLeuTrpAspGlnGlyAsnPhe-$-LeullelleLysAsnLeuLys 

HeGluAspSerAspThrTyrlleCysGluValGlyAspGlnLysGluGluValGlnLeu 

LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeu 

wherein 

-@- is Thr or lie, 

is Val or Ala, and 
-$• is Thr or Pro; or 
the glycosylated derivative thereof. 

17. A~gp120 binding molecule comprising the'following amino acid sequence: 

MetAsnArgGlyValProPheArgHisLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

AlaAlaThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuTrir 

CysThrAlaSerGlnLysLysSerlleGlnPheHisTrpLysAsnSerAsnGln-@-Lys 

HeLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgAla 

AspSerArgArgSerLeuTrpAspGlnGlyAsnPhe-$-LeullelleLysAsnLeuLys 

HeGluAspSerAspThrTyrlleCysGluValGluAspGlnLysGluGluValGlnLeu 

LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeuGlnGlyGlnSerLeuThr 

LeuThrLeuGluSerProProGlySerSerProSerValGlnCysArgSerProArgGly 

LysAsnlleGlnGlyGlyLysThrLeuSerValSerGlnLeuGluLeuGlnAspSerGly 

ThrTrpThrCysThrValLeuGlnAsnGlnLysLysValGluPheLyslleAsplleVal 

ValLeuAlaPheGlnLysAlaSerSerlleValTyrLysLysGluGlyGluGlnValGlu 

PheSerPheProLeuAlaPheThrValGluLysLeuThrGlySerGlyGluLeuTrpTrp 

GInAlaGluArgAlaSerSerSerLysSerTrplleThrPheAspLeuLysAsnLysGlu 

ValSerValLysArgValThrGlnAspProLysLeuGlnMetGlyLysLysLeuProLeu 

HisLeuThrLeuProGlnAlaLeuProGlnTyrAlaGlySerGlyAsnLeuThrLeuAla 

LeuGluAlaLysThrGlyLysLeuHisGlnGluValAsnLeuValValMetArgAlaThr 
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GInLeuGlnLysAsnLeuThrCysGluValTrpGlyProThrSerProLysLeuMetLeu 

SerLeuLysLeuGluAsnLysGluAlaLysValSerLysArgGluLysAlaValTrpVal 

LeuAsnProGluAlaGlyMetTrpGlnCysLeuLeuSerAspSerGlyGlnValLeuLeu 

GluSerAsnlleLysValLeuProThrTrpSerThrProValGlnProMetAlaLeulle 

ValLeuGlyGlyValAlaGlyLeuLeuLeuPhelleGlyLeuGlyllePhePheCysVal 

ArgCysArgHisArgArgArgGlnAIaGluArgMetSerGlnlleLysArgLeuLeuSer 

GluLysLysThrCysGlnCysProHisArgPheGlnLysThrCysSerProlle, 

wherein 

-@- is Thr or He, and 

-$- is Thr or Pro; or 

the glycosylated derivative thereof; 

with the proviso that at least one of -@- and -$- is Thr, 

or comprising the following amino acid sequence: 

MetAsnArgGlyValProPheArgHisLeuLeuLeuVaiLeuGlnLeuAlaLeuLeuPro 

AlaAlaThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 

CysThrAlaSerGlnLysLysSerlleGlnPheHisTipLysAsnSerAsnGln^-Lys 

HeLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgAla 

AspSerArgArgSerLeuTrpAspGInGlyAsnPhe-$-LeullelleLysAsnLeuLys 

HeGluAspSerAspThrTyrlleCysGluValGluAspGlnLysGluGluValGlnLeu 

LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeu 

wherein 

-@- is Thr or He, and 

-S- is Thr or Pro; or 

the glycosylated derivative thereof; 

with the proviso that at least one of -@- and is Thr, 

and wherein the gp120 binding molecule is preferably linked to a cytotoxic polypeptide, radiolabeled or 
NMR imaging agent 

18. A non-human primate, preferably rhesus monkey or the chimpanzee, CD4 fragment which is capable of 
binding to HIV or SIV gp120, which preferably is soluble in aqueous solution. 

19. The non-human primate CD4 fragment of claim 18, wherein said non-human primate is the rhesus 
monkey, comprising the following amino acid sequence: 

MetAsnArgGlylleProPheArgHisLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

AlaValThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGIuLeuThr 

CysThrAlaSerGlnLysLysAsnThrGlnPheHisTrpLysAsnSerAsnGlnlleLys 

HeLeuGlylleGlnGlyLeuPheLeuThrLysGlyProSerLysLeuSerAspArgAla 

AspSerArgLysSerLeuTrpAspGlnGlyCysPheSerMetllelleLysAsnLeuLys 

HeGluAspSerAspThrTyrlleCysGluValGluAsnLysLysGIuGluValGluLeu 

LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeu, 

or wherein said non-human primate is the chimpanzee, comprising the following amino acid sequence: 
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MetAsnArgGlyValProPheArgHisLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

AlaAlaThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 

CysThrAlaSerGlnLysLysSerlleGlnPheHisTrpLysAsnSerAsnGlnThrLys 

HeLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgVal 

AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeullelleLysAsnLeuLys 

HeGluAspSerAspThrTyrlleCysGluValGlyAspGlnLysGluGluValGlnLeu 

LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeu; 

or the glycosylated derivative thereof. 

20. A fusion protein comprising non-human primate CD4, or fragment thereof which is capable of binding to 
HIV or SIV gp120, fused at the C-terminus to a second protein which comprises an immunoglobulin heavy 
chain of the class IgM, lgG1 or lgG3, wherein the variable region of said heavy chain immunoglobulin has 
been replaced with CD4, or HIV gp120-binding fragment thereof, which fusion protein preferably is 
detectably labeled, or linked to a cytotoxic polypeptide, preferably comprising ncm or diphtheria toxin, 
radiolabel or NMR imaging agent. 

21. A fusion protein comprising non-human primate CD4, or fragment thereof which binds to HIV or SIV 
gp120, fused at the terminus to a second protein comprising an immunoglobulin light chain wherein the 
variable region has been deleted, which preferably is detectably labeled or linked to a cytotoxic polypep- 
tide, especially comprising ricin or diphtheria toxin, radiolabel or NMR imaging agent. 

22. The fusion protein of claim 19 or 20. wherein said CD4 fragment is derived from the rhesus monkey, 
comprising the following amino acid sequence: 

MetAsnArgGlylleProPheArgHisLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

AlaValThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 

CysThrAlaSerGlnLysLysAsnThrGlnPheHisTrpLysAsnSerAsnGlnlleLys 

HeLeuGlylleGlnGlyLeuPheLeuThrLysGlyProSerLysLeuSerAspArgAla 

AspSerArgLysSerLeuTrpAspGlnGlyCysPheSerMetllelleLysAsnLeuLys 

NeGluAspSerAspThrTyrlleCysGluValGluAsnLysLysGluGluValGluLeu 

LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeu, or 

wherein said CD4 fragment is derived from the chimpanzee, comprising the following amino acid sequence: 

MetAsnArgGlyValProPheArgHisLeuLeuLeuValLeuGlnLeuAlaLeuLeuPro 

AlaAlaThrGlnGlyLysLysValValLeuGlyLysLysGlyAspThrValGluLeuThr 

CysThrAlaSerGlnLysLysSerlleGlnPheHj8TrpLysAsnSerAsnGlnThrLys 

HeLeuGlyAsnGlnGlySerPheLeuThrLysGlyProSerLysLeuAsnAspArgVal 

AspSerArgArgSerLeuTrpAspGlnGlyAsnPheThrLeullelleLysAsnLeuLys 

HeGluAspSerAspTrirTyrlleCysGluValGlyAspGlnLysGluGluValGlnL^u 

LeuValPheGlyLeuThrAlaAsnSerAspThrHisLeuLeu. 

23. An immunoglobulin-like molecule, comprising the fusion protein of claim 19 and an immunoglobulin light 
chain, which immunoglobulin-like molecule preferably is detectably labelled or linked to a cytotoxic 
polypeptide, radiolabel or NMR imaging agent. 

24. An immunoglobulin-like molecule comprising the fusion protein of claim 21 and an immunoglobulin 
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heavy chain of the class IgM, lgG1 or IgG3, which immunoglobulin-like molecule preferably is detectably 
labeled or wherein said fusion protein is linked to a cytotoxic polypeptide, radiolabel or NMR imaging agent 
25.. A non-human primate CD4 molecule, or an HIV or SIV gp120 binding fragment thereof, linked to a 
cytotoxic polypeptide, radiolabel or NMR imaging agent. 
6 26. A complex, comprising 

a) HIV or SIV gp120 and 

b) substantially pure non-human primate CD4, or an HIV or SIV gp120 binding non-human primate CD4 
fragment, or an HIV or SIV gp120 binding non-human primate CD4 soluble fragment, or the fusion 
protein of claim 20 or 21, or the gp120 binding molecule of claim 17. 

w 27. The complex of claim 26, wherein said gp120 is a part of an HIV or SIV, is expressed on the surface of 
an HIV or SIV-infected cell or is present in solution. 

28. A method for the detection of HIV or SIV gp120 in a sample, comprising 

(a) contacting a sample suspected of containing HIV or SIV gp120 with the fusion protein of claim 20 or 
21, and 

75 (b) detecting whether a complex is formed. 

29. A method for the detection of HIV or SIV gp120 in a sample, comprising 

(a) contacting a sample suspected of containing HIV or SIV gp120 with non-human primate CD4, or 
fragment thereof jwhich is capable of binding to HIV or SIV gp120 ( and wherein preferably said non- 
human primate CD4 or fragment thereof is detectably labeled, 
20 (b) detecting whether a complex has formed. 

30. A method for the detection of HIV or SIV gp120 in a sample, comprising 

(a) contacting a sample suspected of containing HIV or SIV gp120 with the gp120 binding molecule of 
claim 17, which preferably is detectably labeled; and 

(b) detecting whether a complex has formed. 

25 31. A pharmaceutical composition comprising a therapeutically effective amount of substantially pure non- 
human primate CD4 or a therapeutically effective amount of a non-human primate CD4 fragment which is 
capable of binding to HIV or SIV gpl20. and preferably soluble in aqueous solution, or a therapeutically 
effective amount of the gp120 binding molecule of claim 17; and a pharmaceutical^ acceptable carrier. 
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SEQ 10 NO. : 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 

STRAND EDNESS : 
TOPOLOGY: 



Nucleotide with corresponding protein 
1374 bases 

Single 
Linear 



FEATURES: 



None 



ATG AAC CGG GGA ATC CCT TTT AGG CAC TTG CTT CTG GTG CTG CAA CTG 48 
Met Asn Arg Gly lie Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 
5 10 15 

GCG CTA CTC CCA GCA GTC ACC CAG GGA AAG AAA GTG GTG CTG GGC AAG 96 
Ala Leu Leu Pro Ala Val Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

AAA GGG GAT ACA GTG GAA CTG ACC TGT ACA GCT TCG CAG AAG AAG AAC 144 
Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Asn 
35 40 45 

ACA CAA TTC CAC TGG AAA AAC TCC AAC CAG ATA AAG ATT CTG GGA ATT 192 
Thr Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys lie Leu Gly lie 
50 55 60 

CAG GGT CTC TTC TTA ACT AAA GGT CCA TCC AAG CTG AGC GAT CGT GCT 240 
Gin Gly Leu Phe Leu Thr Lys Gly Pro Ser Lys Leu Ser Asp Arg Ala 
65 70 75 80 

GAC TCA AGA AAA AGC CTT TGG GAC CAA GGA TGC TTT TCC ATG ATC ATC 288 
Asp Ser Arg Lys Ser Leu Trp Asp Gin Gly Cys Phe Ser Met lie lie 
85 90 95 

AAG AAT CTT AAG ATA GAA GAC TCA GAT ACT TAC ATC TGT GAA GTG GAG 336 
Lys Asn Leu Lys lie Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 HO 

AAC AAG AAG GAG GAG GTG GAA TTG CTG GTG TTC GGA TTG ACT GCC AAC 384 
Asn Lys Lys Glu Glu Val Glu Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

TCT GAC ACC CAC CTG CTT GAG GGG CAA AGC CTG ACC CTG ACC TTG GAG 432 
Ser Asp Thr His Leu Leu Glu Gly Gin Ser Leu Thr Leu Thr Leu Glu 
130 135 140 

AGC CCC CCT GGT AGT AGC CCC TCA GTG AAA TGT AGG AGT CCA GGG GGT 480 
Ser Pro Pro Gly Ser Ser Pro ser Val Lys Cys Arg Ser Pro Gly Gly 
145 150 155 160 

AAA AAC ATA CAG GGG GGG AGG ACC ATC TCT GTG CCT CAG CTG GAG CGC 528 
Lys Asn He Gin Gly Gly Arg Thr He Ser Val Pro Gin Leu Glu Arg 
165 170 175 
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CAG GAT AGT GGC ACC TGG ACA TGC ACC GTC TCG CAG GAC CAG AAG ACG 576 
Gin Asp Ser Gly Thr Trp Thr cys Thr Val Ser Gin Asp Gin Lys Thr 
180 185 190 

GTG GAG TTC AAA ATA GAC ATC GTG GTG CTA GCT TTC CAG AAG GCC TCC 624 
Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

AGC ACA GTC TAT AAG AAA GAG GGG GAA CAG GTG GAG TTC TCC TTC CCA 672 
Ser Thr Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 
210 215 220 

CTC GCC TTT ACA CTT GAA AAG CTG ACG GGC AGT GGC GAG CTG TGG TGG 720 
Leu Ala Phe Thr Leu Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225 230 235 240 

CAG GCG GAG AGG GCC TCC TCC TCC AAG TCT TGG ATT ACC TTC GAC CTG 768 
Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 
245 250 255 

AAG AAC AAG GAA GTG TCT GTA AAA CGG GTT ACC CAG GAC CCC AAG CTC 816 
Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 
260 265 270 

CAG ATG GGC AAG AAG CTC CCG CTC CAC CTC ACC CTG CCC CAG GCC TTG 864 
Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 
275 280 285 

CCT CAG TAT GCT GGC TCT GGA AAC CTC ACG CTG GCC CTT GAA GCG AAA 912 
Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 

ACA GGA AAG TTG CAT CAG GAA GTG AAC CTC GTG GTG ATG AGA GCC ACT 960 
Thr Gly Lys Leu His Gin Glu Val Asn Leu Val Val Met Arg Ala Thr 
305 310 315 320 

CAG TTC CAG GAA AAT TTG ACC TGT GAA GTG TGG GGA CCC ACC TCC CCT 1008 
Gin Phe Gin Glu Asn Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 
325 330 335 

AAG CTG ACG CTG AGC TTG AAA CTG GAG AAC AAG GGG GCA ACG GTC TCG 1056 
Lys Leu Thr Leu Ser Leu Lys Leu Glu Asn Lys Gly Ala Thr Val Ser 
340 345 350 

AAG CAG GCG AAG GCG GTG TGG GTG CTG AAC CCT GAG GCG GGG ATG TGG 1104 
Lys Gin Ala Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

CAG TGT CTG CTG AGT GAC TCG GGA CAG GTC CTG CTA GAA TCC AAC ATC 1152 
Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn He 
370 375 380 
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AAG GTT GTG CCC ACA TGG CCC ACC CCG GTG CAG CCA ATG GCC CTG ATT 1200 
Lys Val Val Pro Thr Trp Pro Thr Pro Val Gin Pro Met Ala Leu He 
385 390 395 400 

GTG CTG GGG GGC GTT GCG GGC CTC CTG CTT TTC ACT GGG CTA GGC ATC 1248 
Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe Thr Gly Leu Gly He 

405 410 415 

TTC TTC TGT GTC AGG TGC CGG CAT CGA AGG CGT CAA GCA GAG CGG ATG 1296 
Phe Phe Cys val Arg Cys Arg His Arg Arg Arg Gin Ala Glu Arg Met 
420 425 430 

TCT CAG ATC AAG AGA CTC CTC AGT GAA AAG AAG ACC TGC CAG TGC CCT 1344 
Ser Gin He Lys Arg Leu Leu Ser Glu Lys Lys Thr Cys Gin Cys Pro 
435 440 445 

CAC CGG TTT CAG AAG ACA TGT AGC CCC ATT 1374 
His Arg Phe Gin Lys Thr Cys Ser Pro He 
450 455 
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SEQ ID NO.: 2 
SEQUENCE TYPE: Nucleotide with corresponding protein 
SEQUENCE LENGTH: 401 bases 

STRANDEDNESS: Single 
TOPOLOGY: Linear 

FEATURES: None 

ATG AAC CGG GGA ATC CCT TTT AGG CAC TTG CTT CTG GTG CTG CAA CTG 48 
Met Asn Arg Gly lie Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 
5 10 15 

GCG CTA CTC CCA GCA GTC ACC CAG GGA AAG AAA GTG GTG CTG GGC AAG 96 
Ala Leu_Leu Pro Ala val.Thr GliuGly Lys Lys Val Val Leu Gly Lys 
20 25 30 

AAA GGG GAT ACA GTG GAA CTG ACC TGT ACA GCT TCG CAG AAG AAG AAC 144 
Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Asn 
35 40 45 

ACA CAA TTC CAC TGG AAA AAC TCC AAC CAG ATA AAG ATT CTG GGA ATT 192 
Thr Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys lie Leu Gly lie 
50 55 60 

CAG GGT CTC TTC TTA ACT AAA GGT CCA TCC AAG CTG AGC GAT CGT GCT 240 
Gin Gly Leu Phe Leu Thr Lys Gly Pro Ser Lys Leu Ser Asp Arg Ala 
65 70 75 80 

GAC TCA AGA AAA AGC CTT TGG GAC CAA GGA TGC TTT TCC ATG ATC ATC 288 
Asp Ser Arg Lys Ser Leu Trp Asp Gin Gly Cys Phe Ser Met lie lie 
85 90 95 

AAG AAT CTT AAG ATA GAA GAC TCA GAT ACT TAC ATC TGT GAA GTG GAG 336 
Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 110 

AAC AAG AAG GAG GAG GTG GAA TTG CTG GTG TTC GGA TTG ACT GCC AAC 384 
Asn Lys Lys Glu Glu Val Glu Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

TCT GAC ACC CAC CTG CTT 402 
Ser Asp Thr His Leu Leu 
130 
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SEQ ID NO.: 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 

STRAND EDNESS : 
TOPOLOGY: 



Nucleic acid with corresponding protein 
1374 bases 

Single 
Linear 



FEATURES: 



None 



ATG AAC CGG GGA GTC CCT TTT AGG CAC TTG CTT CTG GTG CTG CAA CTG 
Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 
5 10 15 

GCA CTC CTC CCA GCA GCC ACT CAG GGA AAG AAA GTG GTG CTG GGC AAG 
Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys ValTVai Leu Gly Lys 
20 25 30 

AAA GGG GAC ACA GTG GAA CTG ACC TGT ACA GCT TCC CAG AAG AAG AGC 
Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

ATA CAA TTC CAC TGG AAA AAC TCC AAC CAG ACA AAG ATT CTG GGA AAT 
lie Gin Phe His Trp Lys Asn Ser Asn Gin Thr Lys He Leu Gly Asn 
50 55 60 



CAG GGC TCC TTC TTA ACT AAA GGT CCA TCC AAG CTG AAT GAT CGC GTT 
Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Val 
65 70 75 



80 



GAC TCA AGA AGA AGC CTT TGG GAC CAA GGA AAC TTT ACC CTG ATC ATC 
Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Thr Leu He He 
85 90 95 

AAG AAT CTT AAG ATA GAA GAC TCA GAT ACT TAC ATC TGT GAA GTG GGG 
Lys Asn Leu Lys lie Glu Asp Ser Asp Thr Tyr He Cys Glu Val Gly 
100 105 HO 

GAC CAG AAG GAG GAG GTG CAA TTG CTA GTG TTC GGA TTG ACT GCC AAC 
Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

TCT GAC ACC CAC CTG CTT CAG GGG CAG AGC CTG ACC CTG ACC TTG GAG 
Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr Leu Thr Leu Glu 
130 135 140 

AGC CCC CCT GGT AGT AGC CCC TCA GTG CAA TGT AGG AGT CCA AGG GGT 
Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
145 150 155 160 

AAA AAC ATA CAG GGG GGG AAG ACC CTC TCC GTG TCT CAG CTG GAG CTC 
Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 
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CAG GAT AGT GGC ACC TGG ACA TGC ACT GTC TTG CAG AAC CAG AAG AAA 576 
Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 
180 185 190 

GTG GAG TTC AAA ATA GAC ATC GTG GTG CTA GCT TTC CAG AAG GCC TCC 624 
Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

AGC ATA GTC TAT AAG AAA GAG GGG GAA CAG GTG GAG TTC TCC TTC CCA 672 
Ser He Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 
210 215 220 

CTC GCC TTT ACA GTT GAA AAG CTG ACG GGC AGT GGC GAG CTG TGG TGG 720 
Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 

225 230 235 _ _ 240 

CAG GCG GAG AGG GCT TCC TCC TCC AAG TCT TGG ATC ACC TTT GAC CTG 768 
Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 
245 250 255 

AAG AAC AAG GAA GTG TCT GTA AAA CGG GTT ACC CAG GAC CCT AAG CTC 816 
Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 
260 265 270 

CAG ATG GGC AAG AAG CTC CCG CTC CAC CTC ACC CTG CCC CAG GCC TTG 864 
Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 
275 280 285 

CCT CAG TAT GCT GGC TCT GGA AAC CTC ACC CTG GCC CTT GAA GCG AAA 912 
Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 

ACA GGA AAG TTG CAT CAG GAA GTG AAC CTC GTG GTG ATG AGA GCC ACT 960 
Thr Gly Lys Leu His Gin Glu Val Asn Leu Val Val Met Arg Ala Thr 
305 310 315 320 

CAG CTC CAG AAA AAT TTG ACC TGT GAG GTG TGG GGA CCC ACC TCC CCT 1008 
Gin Leu Gin Lys Asn Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 
325 330 335 

AAG CTG ATG CTG AGC TTG AAA CTG GAG AAC AAG GAG GCA AAG GTC TCG 1056 
Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 
340 345 350 

AAG CGG GAG AAG GCG GTG TGG GTG CTG AAC CCT GAG GCG GGG ATG TGG 1104 
Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

CAG TGT CTG CTG AGT GAC TCG GGA CAG GTC CTG CTG GAA TCC AAC ATC 1152 
Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn He 
370 375 380 
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AAG GTT CTG CCC ACA TGG TCC ACC CCG GTG CAG CCA ATG GCC CTG ATT 1200 
Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gin Pro Met Ala Leu He 
385 390 395 400 

GTG CTG GGG GGC GTC GCC GGC CTC CTG CTT TTC ATT GGG CTA GGC ATC 1248 
Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe He Gly Leu Gly He 
405 410 415 

TTC TTC TGT GTC AGG TGC CGG CAC CGA AGG CGC CAA GCA CAG CGG ATG 1296 
Phe Phe Cys Val Arg cys Arg His Arg Arg Arg Gin Ala Gin Arg Met 
420 425 430 

TCT CAG ATC AAG AGA CTC CTC AGT GAG AAG AAG ACC TGC CAG TGC CCT 1344 
Ser Gin He Lys Ara Leu Leu^Ser Glu Lyi^.Lys Thr Cys Gin Cys Pro 

435 " 440 445 

CAC CGG TTT CAG AAG ACA TGT AGC CCC ATT 1374 
His Arg Phe Gin Lys Thr Cys Ser Pro He 
450 455 
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SEQ ID NO. : 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 

' STRANDEDNESS: 
TOPOLOGY: 



Nucleic acid with corresponding protein 
402 bases 

Single 
Linear 



FEATURES: 



None 



ATG AAC CGG GGA GTC CCT TTT AGG CAC TTG CTT CTG GTG CTG CAA CTG 48 
Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

GCA CTC CTC CCA GCA GCC ACT CAG GGA AAG AAA GTG GTG- CTG-GGC AAG 9.6 
Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

AAA GGG GAC ACA GTG GAA CTG ACC TGT ACA GCT TCC CAG AAG AAG AGC 144 
Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

ATA CAA TTC CAC TGG AAA AAC TCC AAC CAG ACA AAG ATT CTG GGA AAT 192 
lie Gin Phe His Trp Lys Asn Ser Asn Gin Thr Lys lie Leu Gly Asn 
50 55 60 

CAG GGC TCC TTC TTA ACT AAA GGT CCA TCC AAG CTG AAT GAT CGC GTT 240 
Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Val 
65 70 75 80 

GAC TCA AGA AGA AGC CTT TGG GAC CAA GGA AAC TTT ACC CTG ATC ATC 288 
Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe thr Leu lie lie 
85 90 95 

AAG AAT CTT AAG ATA GAA GAC TCA GAT ACT TAC ATC TGT GAA GTG GGG 336 
Lys Asn Leu Lys lie Glu Asp Ser Asp Thr Tyr lie cys Glu Val Gly 
100 105 110 

GAC CAG AAG GAG GAG GTG CAA TTG CTA GTG TTC GGA TTG ACT GCC AAC 384 
Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

TCT GAC ACC CAC CTG CTT 402 
Ser Asp Thr His Leu Leu 
130 
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SEQ ID NO* : 5 
SEQUENCE TYPE: Nucleic acid 
SEQUENCE LENGTH: 1374 bases 

STRANDEDNES S : S ing le 
TOPOLOGY: Linear 

FEATURES: Y is C or T 
M is A or C 
S is G or C 



ATGAACCGGG 


GAGTCCCTTT 


TAGGCACTTG 


CTTCTGGTGC 


TGCAACTGGC 


ACTCCTCCCA 


60 


GCAGCCACTC 


AGGGAAAGAA 


AGTGGTGCTG 


GGCAAGAAAG 


GGGACACAGT 


GGAACTGACC 


120 


TGTACAGCTT 


CCCAGAAGAA 


GAGCATACAA 


TTCCACTGGA 


AAAACTCCAA 


CCAGAYAAAG 


180 


ATTCTGGGAA ATCAGGGCTC 


CfPTCTTAACT 


AAAGGTCCAT 


CCAAGCTGAA 


TGATCGCGYT 


240 


GACTCAAGAA 


GAAGCCTTTG 


GGACCAAGGA 


AACTTTMCCC 


TGATCATCAA 


GAATCTTAAG 


300 


ATAGAAGACT 


CAGATACTTA 


CATCTGTGAA 


GTGGGGGACC 


AGAAGGAGGA 


GGTGCAATTG 


360 


CTAGTGTTCG 


GATTGACTGC 


CAACTCTGAC 


ACCCACCTGC 


TTCAGGGGCA 


GAGCCTGACC 


420 


CTGACCTTGG 


AGAGCCCCCC 






JVVTGTAGGAG 


TCCAAGGGGT 


480 


AAAAACATAC 


AGGGGGGGAA 


GACCCTCTCC 


GTGTCTCAGC 


TGGAGCTCCA 


GGATAGTGGC 


540 


ACCTGGACAT 


GCACTGTCTT 


GCAGAACCAG 


AA6AAAGTGG 


AGTTCAAAAT 


AGACATCGTG 


600 


GTGCTAGCTT 


TCCAGAAGGC 


CTCCAGCATA 


GTCTATAAGA 


AAGAGGGGGA 


ACAGGTGGAG 


660 


TTCTCCTTCC 


CACTCGCCTT 


TACAGTTGAA 


AAGCTGACGG 


GCAGTGGCGA 


GCTGTGGTGG 


720 


CAGGCGGAGA 


GGGCTTCCTC 


CTCCAAGTCT 


TGGATCACCT 


TTGACCTGAA 


GAACAAGGAA 


780 


GTGTCTGTAA 


AACGGGTTAC 


CCAGGACCCT 


AAGCTCCAGA 


TGGGCAAGAA 


GCTCCCGCTC 


840 


CACCTCACCC 


TGCCCCAGGC 


CTTGCCTCAG 


TATGCTGGCT 


CTGGAAACCT 


CACCCTGGCC 


900 


CTTGAAGCGA 


AAACAGGAAA 


GTTGCATCAG 


GAAGTGAACC 


TCGTGGTGAT 


GAGAGCCACT 


960 


CAGCTCCAGA 


AAAATTTGAC 


CTGTGAGGTG 


TGGGGACCCA 


CCTCCCCTAA 


GCTGATGCTG 


1020 


AGCTTGAAAC 


TGGAGAACAA 


GGAGGCAAAG 


GTCTCGAAGC 


GGGAGAAGGC 


GGTGTGGGTG 


1080 


CTGAACCCTG 


AGGCGGGGAT 


GTGGCAGTGT 


CTGCTGAGTG 


ACTCGGGACA 


GGTCCTGCTG 


1140 


GAATCCAACA TCAAGGTTCT 


GCCCACATGG 


TCCACCCCGG 


TGCAGCCAAT 


GGCCCTGATT 


1200 


GTGCTGGGGG 


GCGTCGCCGG 


CCTCCTGCTT 


TTCATTGGGC 


TAGGCATCTT 


CTTCTGTGTC 


1260 
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AGG7GCCG6C ACCGAAGGCG CCAAGCASAG CGGATGTCTC AGATCAAGAG ACTCCTCAGT 1320 
GAGAAGAAGA CCTGCCAGTG CCCTCACCGG TTTCAGAAGA CATGTAGCCC CATT 1376 
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SEQ ID NO. : 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 



6 

Nucleic acid 
1377 bases 



STRAND EDNESS : 
TOPOLOGY: 



Single 
Linear 



FEATURES: 



Y is C or T 
M is A or C 



ATGAACCGGG 


GAGTCCCTTT 


TAGGCACTTw Cri 1 LTXviw 1. viv* 


X uWiAV> x uwv 


GCTCCTCCCA 


60 


GCAGCCACTC 


AGGGAAAGAA 


AGTGGTGCTG GGCAAAAAAG 


uuwA X Awiv X 


\7\3AAw X wAWW 


A A V 


TGTACAGCTT 


CCCAGAAGAA 


GAG CAT A CAA TTCCAwvun 


AAAAwlbUAA 




180 


ATTCTGGGAA 


ATCAGGG CTC 


CTTCTTAAwT AAAiiwXwwiX 




TGATCGCGCT 


240 


GACTCAAGAA 


GAAGCCTTTG 




TGATCATCAA 


GAATCTTAAG 


300 


ATAGAAGACT 


CAGATACTTA 




AGAAGGAGGA 


GGTGCAATTG 


360 


CTAGTG TTCG 


(?A1 1 VjAU 1 V»W 




TTCAGGGGCA 


GAGCCTGACC 


420 








AATGTAGGAG 


TCCAAGGGGT 


480 


AAAAACATAC 


AGGGGGGGAA 


GACCCTCTCC GTGTCTCAGC 


TGGAGCTCCA 


GGATAGTGGC 


540 


ACCTGGACAT 


GCACTGTCTT 


GCAGAACCAG AAGAAGGTGG 


AGTTCAAAAT 


AGACATCGTG 


600 


GTGCTAGCTT 


TCCAGAAGGC 


CTCCAGCATA GTCTATAAGA 


AAGAGGGGGA 


ACAGGTGGAG 


660 


TTCTCCTTCC 


CACTCGCCTT 


TACAGTTGAA AAGCTGACGG 


GCAGTGGCGA 


GCTGTGGTGG 


720 


CAGGCGGAGA 


GGGCTTCCTC 


CTCCAAGTCT TGGATCACCT 


TTGACCTGAA 


GAACAAGGAA 


780 


GTGTCTGTAA 


AACGGGTTAC 


CCAGGACCCT AAGCTCCAGA 


TGGGCAAGAA 


GCTCCCGCTC 


840 


CACCTCACCC 


TGCCCCAGGC 


CTTGCCTCAG TATGCTGGCT 


CTGGAAACCT 


CACCCTGGCC 


900 


CTTGAAGCGA 


AAACAGGAAA 


GTTGCATCAG GAAGTGAACC 


TGGTGGTGAT 


GAGAGCCACT 


960 


CAGCTCCAGA 


AAAATTTGAC 


CTGTGAGGTG TGGGGACCCA 


CCTCCCCTAA 


GCTGATGCTG 


1020 


AGCTTGAAAC 


TGGAGAACAA 


GGAGGCAAAG GTCTCGAAGC 


GGGAGAAGGC 


GGTGTGGGTG 


1080 


CTGAACCCTG 


AGGCGGGGAT 


GTGGCAGTGT CTGCTGAGTG 


ACTCGGGACA 


GGTCCTGCTG 


1140 


GAATCCAACA 


TCAAGGTTCT 


GCCCACATGG TCCACCCCGG 


TGCAGCCAAT 


GGCCCTGATT 


1200 
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GTGCTGGGGG GCGTCGCCGG CCTCCTGCTT 
AGGTGCCGGC ACCGAAGGCG CCAAGCAGAG 
GAGAAGAAGA CCTGCCAGTG CCCTCACCGG 



TTCATTGGGC TAGGCATCTT CTTCTGTGTC 1260 
CGGATGTCTC AGATCAAGAG ACTCCTCAGT 1320 
TTTCAGAAGA CATGTAGCCC CATTTGA 1377 
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SEQ ID NO. : 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 



7 

Nucleic acid 
402 bases 



STRANDEDNESS: 
TOPOLOGY: 



Single 
Linear 



FEATURES: 



Y is C or T 
M is A or C 



ATGAACCGGG GAGTCCCTTT TAGGCACTTG CTTCTGGTGC TGCAACTGGC GCTCCTCCCA 60 

GCAGCCACTC AGGGAAAGAA AGTGGTGCTG GGCAAAAAAG GGGATACAGT GGAACTGACC 120 

TGTACAGCTT CCCAGAAGAA GAGCATACAA TTCCACTGGA AAAACTCCAA CCAGAYAAAG 180 

ATTCTGGGAA ATCAGGGCTC CTTCTTAACT AAAGGTCCAT CCAAGCTGAA TGATCGCGCT 240 

GACTCAAGAA GAAGCCTTTG GGACCAAGGA AACTTTMCCC TGATCATCAA GAATCTTAAG 300 

ATAGAAGACT CAGATACTTA CATCTGTGAA GTGGAGGACC AGAAGGAGGA GGTGCAATTG 360 

CTAGTGTTCG GATTGACTGC CAACTCTGAC ACCCACCTGC TT 402 



t 
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SEQ ID NO.: 8 
SEQUENCE TYPE: Protein 
SEQUENCE LENGTH: 458 amino acids 

STRANDEDNESS : S ingle 
TOPOLOGY: Linear 

FEATURES: None 

Met Asn Arc Gly He Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

Ala Leu Leu Pro Ala Val Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Asn 
35 40 45 

Thr Gin Phe His Trp Lys Asn Ser Asn Gin He Lys He Leu Gly He 
50 55 60 

Gin Gly Leu Phe Leu Thr Lys Gly Pro Ser Lys Leu Ser Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Lys Ser Leu Trp Asp Gin Gly Cys Phe Ser Met He He 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 110 

Asn Lys Lys Glu Glu Val Glu Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Glu Gly Gin Ser Leu Thr Leu Thr Leu Glu 
130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Lys Cys Arg Ser Pro Gly Gly 
145 150 155 160 

Lys Asn He Gin Gly Gly Arg Thr He Ser Val Pro Gin Leu Glu Arg 
165 170 175 

Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Ser Gin Asp Gin Lys Thr 
180 185 190 

Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

Ser Thr Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 
210 215 220 

Leu Ala Phe Thr Leu Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225 230 235 240 
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Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 
245 250 255 

Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 
260 265 270 

Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 
275 280 285 

Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 

Thr Gly Lys Leu His Gin Glu Val Asn Leu Val Val Met Arg Ala Thr 
305 310 315 320 

SLxl Leu Gin Lys Asn Leu Thr Cys Glu Vai Trp Gly Pro Thr Ser Pro 
325 330 335 

Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 
340 345 350 

Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn He 
370 375 380 

Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gin Pro Met Ala Leu He 
385 390 395 400 

Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe He Gly Leu Gly He 
405 410 415 

Phe Phe Cys Val Arg Cys Arg His Arg Arg Arg Gin Ala Gin Arg Met 
420 425 430 

Ser Gin He Lys Arg Leu Leu Ser Glu Lys Lys Thr Cys Gin Cys Pro 
435 440 445 



His Arg Phe Gin Lys Thr Cys Ser Pro He 
450 455 
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SEQ ID NO.: 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 



9 

Protein 

456 amino acids 



STRANDEDNESS : Single 
TOPOLOGY : Linear 

FEATURES: None 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

He Gin Phe His Trp Lys Asn Ser Asn Gin Thr Lys He Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Val 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Thr Leu He He 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Gly 
100 105 110 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr Leu Thr Leu Glu 
130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
145 150 155 160 

Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 170 175 

Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 
180 185 190 

Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

Ser He Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 
210 215 220 

Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225 230 235 240 
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Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp lie Thr Phe Asp Leu 
245 250 255 

Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 
260 265 270 

Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 
275 280 285 

Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 

Thr Gly Lys Leu His Gin Glu Val Asn Leu Val Val Met Arg Ala Thr 
305 310 315 320 

(ELn Leu Gin Lys Asn Leu^Tttr Cys Glu Val Trp Gly Pro Thr Ser Pro 
325 330 335 

Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 
340 345 350 

Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn lie 
370 375 380 

Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gin Pro Met Ala Leu lie 
385 390 395 400 

Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe He Gly Leu Gly He 
405 410 415 

Phe Phe Cys Val Arg Cys Arg His Arg Arg Arg Gin Ala Gin Arg Met 
420 425 430 

Ser Gin He Lys Arg Leu Leu Ser Glu Lys Lys Thr Cys Gin Cys Pro 
435 440 445 

His Arg Phe Gin Lys Thr Cys Ser Pro He 
450 455 
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SEQ ID NO. : 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 



10 

Protein 

458 amino acids 



STRANDEDNESS: 
TOPOLOGY : 



Single 
Linear 



FEATURES : 



Glx is Glu or Gin 
Xaa at position 59 
Xaa at position 80 
Xaa at position 93 



is Thr or lie 
is Val or Ala 
is Thr or Pro 



Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 _ 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

He Gin Phe His Trp Lys Asn Ser Asn Gin Xaa Lys He Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Xaa 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Xaa Leu He He 
85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Gly 
100 105 110 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr Leu Thr Leu Glu 
130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
145 150 155 160 

Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 170 175 

Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 
180 ' 185 190 

Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

Ser He Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 
210 215 220 
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Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225 230 235 240 

Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 
245 250 255 

Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 
260 265 270 

Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 
275 280 285 

Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 

Thr Gly Lys Leu His GIiTgIu Val Asn Leu Val Vai Met Arg Ala Thr 
305 310 315 320 

Gin Leu Gin Lys Asn Leu Thr cys Glu Val Trp Gly Pro Thr Ser Pro 
325 330 335 

Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Srr 
340 345 350 

Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn He 
370 375 380 

Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gin Pro Met Ala Leu He 
385 390 395 400 

Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe He Gly Leu Gly He 
405 410 415 

Phe Phe Cys Val Arg Cys Arg His Arg Arg Arg Gin Ala Glx Arg Met 
420 425 430 

Ser Gin He Lys Arg Leu Leu Ser Glu Lys Lys Thr Cys Gin Cys Pro 
435 440 445 

His Arg Phe Gin Lys Thr Cys Ser Pro He 
450 455 
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SEQ ID NO. : 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 



11 

Protein 

134 amino acids 



STRANDEDNESS: 
TOPOLOGY: 



Single 
Linear 



FEATURES: 



Xaa 
Xaa 
Xaa 



at position 59 is Thr or lie 
at position 80 is Val or Ala 
at position 93 is Thr or Pro 



Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

He Gin Phe His Trp Lys Asn Ser Asn Gin Xca Lys He Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Xaa 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Xaa Leu He He 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Gly 
100 105 HO 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu 



130 
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SEQ ID MO. : 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 



12 

Protein 

458 amino acids 



STRANDEDNESS : 
TOPOLOGY: 



Single 
Linear 



FEATURES 



Xaa at position 59 is Thr or He 
Xaa at position 93 is Thr or Pro 



Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 — 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

He Gin Phe His Trp Lys Asn Ser Asn Gin Xaa Lys He Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Xaa Leu He lie 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 HO 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr Leu Thr Leu Glu 
130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
145 150 155 160 

Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 170 175 

Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 
180 185 190 

Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

Ser He Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 



210 



215 



220 
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Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225 230 235 240 

Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 
245 250 255 

Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 
260 265 270 

Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 

275 280 285 

Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 

Thr Gly Lys Leu His Gin Glu Val -Asn Lsu Val Val Met Arg Ala Thr 
305 310 315 320 

Gin Leu Gin Lys Asn Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 
325 330 335 

Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 
340 345 350 

Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn He 
370 375 380 

Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gin Pro Met Ala Leu He 
385 390 395 400 

Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe He Gly Leu Gly He 
405 410 415 

Phe Phe Cys Val Arg Cys Arg His Arg Arg Arg Gin Ala Glu Arg Met 
420 425 430 

Ser Gin He Lys Arg Leu Leu Ser Glu Lys Lys Thr Cys Gin Cys Pro 
435 440 445 

His Arg Phe Gin Lys Thr Cys Ser Pro He 
450 455 
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SEQ ID NO. : 13 
SEQUENCE TYPE: Protein 
SEQUENCE LENGTH: 134 amino acids 

STRANDEDNESS: Single 
TOPOLOGY: Linear 

FEATURES: None 
Met Asn Arg Gly He Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

Ala Leu Leu Pro Ala Val Thr Gin Gly Lys Lys Val val Leu Gly Lys 
20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Asn 
35 40 45 

Thr Gin Phe His Trp Lys Asn Ser Asn Gin He Lys He Leu Gly He 
50 55 60 

Gin Gly Leu Phe Leu Thr Lys "Gly Pro Ser Lys Leu Ser Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Lys Ser Leu Trp Asp Gin Gly Cys Phe Ser Met He He 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 HO 

Asn Lys Lys Glu Glu Val Glu Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu 
130 
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SEQ ID NO. : 14 
SEQUENCE TYPE: Protein 
SEQUENCE LENGTH: 134 amino acids 

STRANDEDNESS : Single 
TOPOLOGY: Linear 

FEATURES: None 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

Lys Gly^Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

He Gin Phe His Trp Lys Asn Ser Asn Gin Thr Lys He Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Val 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Thr Leu He He 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Gly 
100 105 110 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu 
130 
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SEQ ID NO. : 15 
SEQUENCE TYPE: Protein 
SEQUENCE LENGTH: 134 amino acids 

STRANDEDNESS: Single 
TOPOLOGY : Linear 

FEATURES: Xaa at position 59 is Thr or lie 
Xaa at position 93 is Thr or Pro 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 _ 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

He Gin Phe His Trp Lys Asn Ser Asn Gin Xaa Lys He Leu Gly Asn 
50 55 60 

Gin Gly ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Xaa Leu He lie 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 HO 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu 
130 
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SEQ ID NO.: 
SEQUENCE TYPE: 
SEQUENCE LENGTH: 



16 

Protein 

134 amino acids 



STRANDEDNESS: 
TOPOLOGY : 



Single 
Linear 



FEATURES: None 

Met Asn Arg Gly He Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

Ala Leu Leu Pro Ala Val Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

Lys Gly^Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Asn 
35 40 45 

Thr Gin Phe His Trp Lys Asn Ser Asn Gin He Lys He Leu Gly He 
50 55 60 

Gin Gly Leu Phe Leu Thr Lys Gly Pro Ser Lys Leu Ser Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Lys Ser Leu Trp Asp Gin Gly Cys Phe Ser Met He He 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 HO 

Asn Lys Lys Glu Glu Val Glu Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu 



130 
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SEQ ID NO.: 17 
SEQUENCE TYPE: Protein 
SEQUENCE LENGTH: 134 amino acids 

STRAND EDNESS : Single 
TOPOLOGY: Linear 

FEATURES: None 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

5 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

He Gin Phe His Trp Lys Asn Ser Asn Gin Thr Lys He Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Val 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Thr Leu He He 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Gly 
100 105 HO 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu 
130 



C195-09.WP5 



87 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 
. □ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



